PCX 

INTERNATIONAL APPLICATION PUBLISHE 



WORLD INTELLEC 
In 





(51) International Patent Classification ^ : 

GOIN 33/53, C12Q 1/70, C12N SAO, 
7/01, C07K 7A>0, 14/00, A61K 38/03 



Al 



wo 9603649A1 

(43) International PublicaU n Date: 8 Febniary 1996 (08.02.96) 



(21) International Application Number: PCT/US9S/09382 

(22) International Filing Date: 24 July 199S (24.07.95) 



(30) Priority Data: 

278,865 
483.555 



22 July 1994 (22.07.94) US 
7 June 1995 (07.06.95) US 



(71) Applicant: THE UNIVERSITY OF NORTH CAROLINA AT 

CHAPEL HILL [US/US); Office of Technology Develop- 
ment, CB #4100, 302 Bynum Hall. Chapel Hill. NC 27599- 
4100 (US). 

(72) Inventors: SPARKS, Andrew, B.; 201 Blue Ridge, Carrboro. 

NC 27510 (US). KAY, Brian, K,; 18 Wystcria Way, Chapel 
Hill, NC 27514 (US). THORN, Judith. M.; Apartment 18- 
G. 200 Bamcs, Cairboro, NC 27510 (US). QUILLIAM, 
Lawrence, A,; Apartment 5. 206 Pinegate Circle. Chapel 
HiU. NC 27514 (US). DER, Channing. J,; 101 Fieldstone 
Court. Chapel Hill, NC 27514 (US). 

(74) Agents: MISROCK, S., Leslie et al.; Pennie & Edmonds, 1 155 
Avenue of the Americas. New York, NY 10036 (US). 



(81) Designated States: AM. AU, BB, BG. BR. BY. CA. CN, CZ, 
EE, n, GE. HU. IS. JP, KG. KP, KR. KZ. LK. LR. LT. 
LV, MD. MG. MN, MX. NO, NZ, PL, RO, RU. SG. SI. 
SK, TJ, TM, TT. UA, UZ. VN. European patent (AT. BE, 
CH, DE. DK. ES, FR, GB, GR. IE. IT, LU. MC, NL, PT, 
SE), OAPI patent (BF. BJ. CF. CG, CI, CM. GA. GN, ML, 
MR, NE, SN, TD, TG), ARIPO patent (KE. MW. SO. SZ. 
UG). 



Published 

With international search report. 



(54) TiUe: Sir SH3 BINDING PEPTIDES AND IVIETHODS OF ISOLATING AND USING SAME 



(57) Abstract 



Peptides having general and specific binding affinities for the Src homology region 3 (SH3) domains of proteins are disclosed in the 
present invention. In particular, SH3 binding peptides have been isolated from three phage -displayed random peptide libraries which had 
been screened for isolates that bind to bacterial fusion proteins comprising SH3 and glutathione S-transferase (GST). Preferred peptides are 
disclosed which comprise a core 7-mer sequence (preferably, a consensus motiO and two or more, preferably at least six, additional amino 
acid residues flanking the core sequence, for a total length of 9. preferably at least 13. amino acid residues and no more than about 45 
amino acid residues. Such peptides manifest preferential binding affinities for certain SH3 domains. The preferred peptides exhibit specific 
binding affinities for the Src-family of proteins, /n vitro and in vivo results are presented which demonstrate the biochemical activity of 
such peptides. 
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Src SH3 BINDING PEPTIDES AND 
METHODS OF ISOLATING AND USING SAME 

1* Field o£ the Invention 

5 The present invention relates to SH3 binding peptides 

having a broad range of binding specificities. That is, certain 
members of the SH3 binding peptides disclosed bind with 
approximately the same facility with SH3 domains derived from 
different SH3 domain- containing proteins. Other members, in 

10 contrast, bind with a much greater degree of affinity for 

specific SH3 domains. The SH3 binding peptides are obtained from 
random peptide libraries that are also phage-displayed. Methods 
are described of obtaining the phage clones that bind to the SH3 
domain targets and of determining their relevant nucleotide 

15 sequences and consequent primary amino acid sequence of the 

binding peptides. The resulting SH3 binding proteins are useful 
in a number of ways, including, but not limited to, providing a 
method of modulating signal transduction pathways at the cellular 
level, of modulating oncogenic protein activity or of providing 

20 lead compounds for development of drugs with the ability to 

modulate broad classes, as well as specific classes, of proteins 
involved in signal transduction. 

2 • Background of the Invention 
2 • 1 • Src and the SH3 Domain 

25 Among a number of proteins involved in eukajryotic 

cell signaling, there is a common sequence motif called the SH3 
domain. It is 50-70 amino acids in length, moderately conserved 
in primary structure, and can be present from one to several 
times in a large number of proteins involved in signal 

30 transduction and in cytoskeletal proteins. 

The protein pp60c-src represents a family of at least nine 
non-receptor protein tyrosine kinases (NR-PTKs) . Members of this 
family share an overall structural organization comprising a 
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series of catalytic and non-catalytic domains. in Src. a 14- 
amino-acid myristylation -signal resides at the extreme amino- 
terminus, and is followed by a unique region that is not highly 
conserved among family members. Following this region are two 
5 highly conserved 60- amd 100-amino-acid regions, the Src homology 
(SH) domains 3 and 2, respectively. SH2 and SH3 domains have 
been shown to play an important role in mediating protein-protein 
interactions in a variety of signaling pathways. Koch, C.A,, et 
al., in Science (1991) 252:668-74. The carboxy- terminal half of 
10 Src contains the PTK catalytic domain, as well as a negative 
regulatory tyrosine (Y527) near the carboxy terminus . 
Phosphorylation of this residue (e.g., by Csk) results in the 
inhibition of PTK activity. Cooper, J. A., et al., in Sqiepce 

(1986) 231:1431-1434. Mutation of y527->F generates forms of Src 
15 with increased PTK and oncogenic activity* Cartwright, C.A. , et 

al., in Cell (1987) 49:83-91; Kmiecik, T.E,, et al . , in Cell 

(1987) 49:65-73; and P iwicna- Worms , H, , et al . , in Cell (1987) 
75-82. 

The fact that some mutations which result in increased Src 

20 PTK and transforming activity map to the Src SH2 (Seidel-Dugan, 
C, et al., in 1^91,- Cell- Biol. (1992) 12:1835-45; and Hirai, H. 
and Varmus, H.E. in Mnl . Cell. Biol, (1990) 10:1307-1318) and SH3 
domains (Seidel-Dugan, C, et al . , supra; Hirai, H. and Varmus, 
H.E,, supra; Superti-Furga, G. , et al . , in Embo. J, (1993) 

25 12:2625-34; and Potts, W.M. , et al., in Qng^qgng RgQr (1988) 

3:343-355) suggests a negative regulatory role for these domains. 
That phosphotyrosine residues within specific sequence contexts 
represent high affinity ligands for SH2 domains suggests a model 
in which the SH2 domain participates in Y527 -mediated inhibition 

3 0 of PTK activity by binding phosphorylated Y527, thereby locking 
the kinase domain in an inactive configuration. Matsuda, M. , 
Mayer, B.J., et al., in Science (1990) 248:1537-1539. This model 
is supported by the observation that phosphopeptides 
corresponding to the carboxy- terminal tail of Src bind active, 

3 5 but not inactive, variants of Src, Roussel, R.R., et al., in 
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Proc. Natl. Acad. Sci . USA (1991) 88:10696-700; and Liu, X., et 
al., in Oncogene (1993) a:1119-1126. 

The mechanism of SH3 -mediated inhibition of Src PTK activity 
remains unclear. There is evidence that pY527-mediated 
5 inhibition of Src PTK activity involves the SH3 domain as well as 
the SH2 domain. Okada, M., Howell, et al . , in J. Biol, Chem. 
(1993) 268:18070-5; Murphy, S.M., et al . , in Mol . Cell. Biol. 
(1993) 13:5290-300; and Superti-Furga, G. , et al . , supra. 
Although these effects are thought to be a consequence of SH3- 
10 mediated protein-protein interactions, precisely how the Src SH3 
domain exerts its negative regulatory effect is unclear. 
Identification of high affinity ligands for the Src SH3 domain 
could help resolve these issues. 

2.2. Protein Tyrosine Kinases and The Iiimnme Response 

15 Src-related tyrosine kinases are expressed in a variety 

of cell types including those of the immune system (lymphocytes, 
T cells, B cells, and natural killer cells) and the central 
nervous system (neural cells, neurons, oligodendrocytes, parts of 
the cerebellum, and the like)* Umemori, H. et al • , in Brain Res. 

20 Mol ■ Brain Res - (1992) Dec. 16 (3-4) :303-310 . Their presence in 
these cells and tissues and their interaction with specific cell 
surface receptors and immunomodulatory proteins (such as T cell 
antigen receptor, CD14 , CD2, CD4, CD40 or CD45) suggest that 
these kinases serve an important role in the signalling pathways 

25 of not only the central nervous system but of the immune system, 
as well. See, e.g., Ren, C.L. et al . , in J . Exp . Med . (1994) 
179 (2) : 673-680 (signal transduction via CD40 involves activation 
of Lyn kinase); Donovan, J. A, and Koretzky, G,A., in J, Am. Soc, 
Nephrol . (1993) 4(4):976-985 (CD45, the immiine response, and 

30 regulation of Lck and Fyn kinases); and Carmo, A.M. et al . , in 

Eur . J . Immunol ■ (1993) 23 (9) : 2196-2201 (physical association of 
the cytoplasmic domain of CD2 with p561ck and p59fyn) . 

For instance, mice with disruptions in their Src-like genes, 
Hck and Pgr, possess macrophages with impaired phagocytic 



BN8DOCID: WWO_8G0364aA1JL> 



wo 96/03649 PCTAJS95/09382 



activity or exhibit a novel iimnuncdef iciency charactsrizsd by an 
increased susceptibility .to infection with Listeria 
monocytogenes. Lowell, C.A. et al., in Gene a Dev. (1994) 
8 (4) :387-398 . Also, it has been shown that bacterial 
5 lipopolysaccharide (LPS) activates CD14 -associated p56lyn, 
pGBhck, and p59c-fgr, while inducing the production of 
lymphokines , such as TNF-alpha, IL-1, IL-6, and IL-8. Inhibition 
of the protein tyrosine kinases blocks production of TNF-alpha 
and IL-1. 

10 2.3. SH3 Binding Peptides 

As mentioned above, it has long been suspected that SH3 
domains are sites of protein-protein interaction, but it has been 
unclear what SH3 domains actually bind. Efforts to identify 
ligands for SH3 domains have led to the characterization of a 

15 number of SH3-binding proteins, including 3BP1 and 2 (Ren, R. , 
Mayer, et al . , in science (1993) 259:1157-61), SOS (Olivier, 
J. P., et al,, in Cell (1993) 73:179-91; and Rozakis-Adcock, M. , 
et al., in Nature (1993) 363:83-5), p85 PI-3' Kinase (Xingquan, 
L., et al., in Mnl , rell. Biol, (1993) 13:5225-5232), dynamin 

20 (Gout, I., et al., in Cgii (1993) 75:25-36), AFAP-110 (Flynn, 

D.C, et al-, Sn Mnl ■ Cell. Biol. (1993) 13:7892-7900), andCD42 
(Barfod, E.T., et al . , in J. Biol. Chem. 11993) 268:26059-26062). 
These proteins tend to possess short, proline-rich stretches of 
amino acids, some of which have been directly implicated in SH3 

25 binding. A variety of consensus sequences have been proposed, 
although the similarity among proline-rich regions of different 
SH3 -binding proteins tends to be fairly low. Also, attempts to 
build consensus sequences are likely complicated by the 
incorporation of data from proteins that bind different SH3 

30 domains. 

Thus, Cicchetti, P., et al . , in Science (1992) 257:803-806, 
published their work relating to the isolation and sequencing of 
two naturally-occurring proteins that could be bound In vitro by 
the SH3 domain of the aJbl oncogene product. These workers found 
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that SH3 domains bind short, proline-rich regions of such 
proteins. Subsequently, this same group disclosed further 
results (Ren, R. et al . , aupra) in which the SH3 binding sites of 
the SH3 binding proteins were localized to "a nine- or ten-amino 
5 acid stretch rich in proline residues." A consensus sequence 

incorporating the features of the SH3 binding sites of four SH3 
binding proteins was proposed: XPXXPPPYXP (SEQ ID NO:l), wherein 
X indicates a position in the amino acid sequence which is not 
conserved among the four SH3 binding proteins, P represents 
10 proline, and ¥ indicates a hydrophobic amino acid residue, such 
as P or L. 

The screening of complex random peptide libraries has been 
used to identify peptide epitopes for monoclonal (Scott, J.K. and 
Smith, G.P.. in Science (1990) 249:386-390) and polyclonal (Kay, 

15 B,K., et al., in Gene (1993) 128:59-65) antibodies, as well as 

peptide ligands for a variety of proteins, including streptavidin 
(Devlin, J.J., et al,, in Science (1990) 249:404-406; and Lam, 
K., et al., in Nature (1991) 354:82-84), the endoplasmic 
reticulum chaperone BiP ( Blond- Elguindi , S., et al., in Cell 

20 (1993) 75:717-728), and CaM (Dedman, J.R., et al . , in J. Biol. 

Chem. (1993) 268:23025-23 030). 

Recently, Chen, J.K. et al . , in J. Am. Chem. Soc . (1993) 
115:12591-12592, described ligands for the SH3 domain of 
phosphatidylinositol 3-kinase (PI-3' Kinase) which were isolated 

25 from a biased combinatorial library. A "biased" library is to be 
distinguished from a "random" library in that the amino acid 
residue at certain positions of the synthetic peptide are fixed, 
i.e., not allowed to vary in a random fashion. Indeed, as stated 
by these research workers, screening of a "random" combinatorial 

30 library failed to yield suitable ligands for a PI-3' Kinase SH3 

domain probe . The binding affinities of these unsuitable ligands 
was described as weak, >100 /xM, based on dissociation constants 
measured by the Biosensor System (BIAcore) . 



- 5 
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More recently.. Yu, et al = (Yu, H,, et al . , in Cell (1994) 
76:933-945) used a "biased" synthetic peptide library of the form 
XXXPPXPXX (SEQ ID NO: 2), wherein X represents any amino acid 
other than cysteine, to identify a series of peptides which bind 
the Src and PI-3' Kinase SH3 domains. The bias was accomplished 
by fixing the proline residues at the specific amino acid 
positions indicated for the "random" peptide. As stated 
previously, without this bias, the technique disclosed fails to 
identify any SH3 domain -binding peptides. 

A consensus sequence, based on 13 binding peptides was 
suggested: RXLPPRPXX (SEQ ID N0:3) , where X tends to be a basic 
residue (like R, K or H) . The binding affinities of several SH3 
binding peptides were disclosed as ranging from 8.7 to 30 /iM. A 
"composite" peptide, RKLPPRPRR (SEQ ID N0:4) , was reported to 
have a binding affinity of 7.6 /iM. This value compares favorably 
to the binding affinity of the peptide, VPPPVPPRRR (SEQ ID N0:5), 
to the N- terminal SH3 domain of Grb2 , See, Kraulis, P.J. 
AppI . Crvstalloar . (1991) 24:946. Recognizing the limitations of 
their technique, Chen and co-workers, aupra, stated that their 
results "illustrate the utility of biased combinatorial libraries 
for ligand discovery in systems where there is some general 
knowledge of the ligand-binding characteristics of the receptor'* 
(emphasis added) . 

Yu and co-workers, supra, further described an SH3 binding 
site consensus sequence, Xp0PpXP (SEQ ID NO: 6), wherein X 
represents non-conserved residues, 0 represents hydrophobic 
residues, P is proline, and p represents residues that tend to be 
proline. A consensus motif of RXLPPRPXX (SEQ ID NO: 7), where X 
represents any amino acid other thcui cysteine, was proposed for 
ligands of PI-3' Kinase SH3 domain. A consensus motif of 
RXLPPLPR0 (SEQ ID NO: 8), where 4> represents hydrophobic residues, 
was proposed for ligands of Src SH3 domain. Still, the 
dissociation constants reported for the 9-mer peptides ranged 
only from" about 8-70 ;xM and selectivity between one type of SH3 
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domain and another was relatively poor, the K^a differing by only 
about a factor of four. 

Hence, there remains a need to develop techniques for the 
identification of Src SH3 binding peptides which do not rely on 
5 such "biased" combinatorial peptide libraries that are limited to 
a partially predetermined set of amino acid sequences. Indeed, 
the isolation of SH3 binding peptides from a "random" peptide 
library has not been achieved successfully before now. 
Furthermore, particular peptides having much greater binding 

10 affinities, whether general or more selective binding for 

specific SH3 domains, remain to be identified. Binding peptides 
specific for particular SH3 domains are useful, for example, in - 
modulating the activity of a particular SH3 domain-containing 
protein, while leaving others bearing an SH3 domain unaffected. 

15 Still, the more promiscuous general binding peptides are useful 
for the modulation of a broad spectrum of SH3 domain- containing 
proteins. 

The present invention relates to such SH3 binding peptides, ^ 
methods for their identification, and compositions comprising 

2 0 same. In particular, peptides comprising particular sequences of 

amino acid residues are disclosed which were isolated from random 
peptide libraries. In the present invention, clones were 
isolated from a phage -displayed random peptide library which 
exhibited strong binding affinities for SH3 domain- containing 
25 protein targets. Some of these protein targets, include Abl, 
Src, Grb2, PLC- 6, PLC- 7, Ras GAP, Nek, and p85 PI -3' Kinase. 
From the nucleotide sequence of the binding phage, the amino acid 
sequence of the peptide inserts has been deduced. Synthetic 
peptides having the desired amino acid sequences are shown to 

3 0 bind the SH3 domain of the target proteins. In particular, 

synthetic peptides combining a core consensus sequence and 
additional amino acid residues flanking the core secjuence are 
especially effective at binding to particular target protein SH3 
domains. The SH3 binding peptides disclosed herein can be 
3 5 utilized in a number of ways, including the potential modulation 

- 7 - 
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of oncogenic protein activity in vivo. Thege peptides also serve 
as useful leads in the production of peptidomimetic drugs that 
modulate a large class of proteins involved in signal 
transduction pathways and oncogenesis. 

5 3 . g^-wwfnay y pf the Invention 

Accordingly, three phage -displayed random peptide 
libraries were screened for isolates that bind to bacterial 
fusion proteins consisting of the Src homology region 3 (SH3) and 
glutathione S- transferase (GST) . DNA sequencing of the isolates 

10 showed that they contained sequences that resemble the consensus 
motif, RPLPPLP (SEQ ID N0:9), within their 8, 22, or 36 amino 
acid long random regions. When peptides were synthesized 
corresponding to the pIII inserts of the SH3 -binding phage, they 
bound to the GST fusions of the SH3 domains of Src and the Src- 

15 related proteins, such as Yes, but not of Grb2 , Crk, 7^1, or 

PLC7I . The synthesized peptides bind quite well to the Src SH3 
domain and act as potent competitors of natural Src SH3 
interactions in cell lysates . For instance, these peptides can 
compete with radiolabelled proteins from cell lysates in binding 

20 to immobilized Src-GST, with an apparent IC50 of 1-10 ^M. When a 
peptide, bearing the consensus sequence RPLPPLP (SEQ ID NO: 9) was 
injected into Xenopus laevis oocytes, it accelerated the rate of 
progesterone-induced maturation. These results demonstrate the 
utility of phage -displayed random peptide libraries in 

25 identifying SH3 -binding peptide sequences and that such 

identified peptides exhibit both in vivo and in vitro biological 
activity. 

Thus, it is an object of the present invention to provide 
peptides having at least nine and up to forty- five amino acid 
3 0 residues, including an amino acid sequence of the formula, 

R-2-L-P-5-6-P-8-9 (SEQ ID NO:10), positioned anywhere along the 
peptide, in which each number represents an amino acid residue r 
such that 2 represents any amino acid residue except cysteine, 5 
and 6 each represents a hydrophobic amino acid residue, 8 

- 8 - 
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represents any amino acid residue except cysteine, and 9 
represents a hydrophilic amino acid residue except cysteine, each 
letter being the standard one-letter symbol for the corresponding 
amino acid, said peptide exhibiting a binding affinity for the 
SH3 domain of Src, provided that said peptide is not R-P-L-P-P-L- 
P-T-S (SEQ ID N0:11). In a particular embodiment of the present 
invention, the peptides also exhibit a binding affinity for the 
SH3 domain of Src- related proteins, including Yes, Pyn, Lyn, Lck, 
Hck and Pgr. 

The present invention also contemplates SH3 domain -binding 
peptides that further comprise a C- terminal -flanking amino acid 
sequence of the formula 10, 10-11, 10-11-12, 10-11-12-13; (SEQ ID 
NO:12) or 10-11-12-13-14 (SEQ ID NO:13), in which each number 
represents any amino acid residue except cysteine, such that 10 
is bound to 9 by a peptide bond. Furthermore, peptides are also 
provided which further comprise an N- terminal -flanking amino acid 
seq[uence of the formula 1', 2'-l', 3' -2'-!' or 4' -3'-2'-l' (SEQ 
ID NO; 14) in which each number represents any amino acid residue 
except cysteine, such that 1' is bound to R by a peptide bond. 

Thus, in a particular embodiment, a peptide is disclosed 
having at least thirteen and up to forty- five amino acid 
residues, including an amino acid sequence of the formula, 3'-2'- 
1' -R-2-L-P-5-6-P-8-9-10 (SEQ ID NO:15), positioned anywhere along 
the peptide, in which each number represents an amino acid 
residue, such that 3', 2', 1', 2, 8, and 10 each represents any 
amino acid residue except cysteine, 5 and 6 each represents a 
hydrophobic amino acid residue, and 9 represents a hydrophilic 
amino acid residue except cysteine, each letter being the 
standard one -letter symbol for the corresponding amino acid, said 
peptide exhibiting a binding affinity for the SH3 domain of Src. 

The present invention also seeks to provide new consensus 
sequences or motifs that reflect variations in SH3 domain binding 
selectivities or specificities*. The present invention also 
contemplates conjugates of the SH3 binding peptides and a second 
molecule or chemical moiety. This second molecule may be any 

- 9 - 
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desired sxibscance whose delivery to the region of the SH3 domain 
of a particular protein Cor cell containing the protein) is 
sought. Possible target cells include, but are not limited to, 
neural cells, immune cells (e.g., T cells, B cells, natural 
5 killer cells, and the like), osteoclasts, platelets, epidermal 
cells, and the like, which cells express Src, Src-related 
proteins, and potentially, other SH3 domain- containing proteins. 
In this manner, the modulation of the biological activity of 
proteins bearing an SH3 domain can be accomplished. 

10 Other methods and compositions consistent with the 

objectives of the present invention are likewise disclosed. In 
particular, a method is disclosed of modulating the activity of 
Src or Src-related proteins comprising administering a 
composition comprising an effective amoiint of a peptide of the 

15 present invention and a carrier, preferably a pharmaceutically 
acceptable carrier. In a specific embodiment, the contemplated 
method results in the inhibition of the activity of Src or Src- 
related proteins. Alternatively, the method is effective to 
activate Src or Src-related proteins. 

20 In yet another embodiment, a method is disclosed of 

identifying a peptide having a region that binds to an SH3 domain 
comprising: (a) providing an immobilized target protein 
comprising an SH3 domain; (b) incubating the immobilized target 
protein with an aliquot taken from a random peptide library; (c) 

25 washing unbbxind library peptides from the immobilized target 
protein; (d) recovering the peptide bound to the immobilized 
target protein; and (e) determining the primary secfuence of the 
SH3 domain-binding peptide. 

Moreover, a method is disclosed of imaging cells, tissues, 

30 and organs in which Src or Src-related proteins are expressed, 
which comprises administering em effective amount of a 
composition comprising cm SH3 domain -binding peptide conjugated 
to detectable label or an imaging agent. 

Other objectives of the present invention will become 

3 5 apparent to one of ordinary skill in the art after consideration 

- 10 - 
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Of the above disclosure and the following detailed description of 
the preferred embodiments'. 

4. Brief Description of the Picrurea 

FIG. 1 illustrates a scheme for the generation of a 
5 random 36 amino acid peptide library (TSAR-9; e.g., SEQ ID 

NO:16) . Oligonucleotides were synthesized (SEQ ID NOS:17-18), 
converted into double -stranded DNA, cleaved with restriction 
enzymes (SEQ ID NOS:19-20), and cloned into the M13 vector, m663 . 
The random peptide region encoded by the oligonucleotides is 
10 shown in the box (SEQ ID NO: 16) and is situated at the N- terminus 
of mature protein III (SEQ ID NO: 21) . 

FIG. 2 illustrates a scheme for the generation of a random 
22 amino acid peptide library (TSAR-12; e.g., SEQ ID NO:23). 
Oligonucleotides were synthesized (SEQ ID NOS: 24-25), converted 
15 into double -stranded DNA, cleaved with restriction enzymes (SEQ 
ID NOS:26-27), and cloned into the M13 vector, m663 . The random 
peptide region encoded by the oligonucleotides is shown in the 
box (SEQ ID NO: 23) and is situated at the N- terminus of mature 
protein III (SEQ ID NO:28). 
20 FIG. 3 illustrates a scheme for the generation of a random 8 

amino acid peptide library (R8C; SEQ ID NO: 30) , Oligonucleotides 
were synthesized (SEQ ID NOS:31-32), converted into double- 
stranded DNA, cleaved with restriction enzymes (SEQ ID NOS:33- 
34), and cloned into the M13 vector, m663. The random peptide 
25 region (SEQ ID NO: 30) is flanked by cysteine residues and is 

situated at the N-terminus of mature protein III (SEQ ID NO: 35) . 

FIG. 4 illustrates the possible origin of one class of 
doxible- insert R8C recombinants (e.g., encoding SEQ ID NO:36). 
Double -stranded oligonucleotides (e.g., SEQ ID NO:37) may have 
30 ligated in a head-to-head fashion at the Xba 1 site prior to 
cloning in the Xho I- Xba I cleaved M13 vector. 

FIG. 5 shows a list of ratidom peptide recombinants (SEQ ID 
NOS:38-61 and 106) isolated by the method of the present 
invention and the displayed peptide sequence. The amino acid 
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seqpaences are aligned to highlight the core sequences. The 
flanking sequences are sriown to the N-terminal and C-terminal 
ends of the core secjuence . 

FIG, 6 graphically illustrates the relative binding 
affinities of selected phage clones for various SH3 domains. The 
results indicate that certain amino acid sequences provide 
generic SH3 domain binding, while others can provide greater 
selectivity for the SH3 domain of Src . Still other clones 
exhibit Src SH3 domain preferential binding. 

FIG. 7 shows the binding of synthetic peptides (SEQ ID N0S:9 
and 62-70) representing Src SH3- selected phage inserts to Src 
SH3-GST fusion target (shaded columns) over background GST 
binding (\inshaded columns) relative to the core peptide RPLPPLP 
(SEQ ID NO: 9) and proline-rich peptide segments derived from 
naturally occurring proteins- Bound biotinylated peptide was 
detected with streptavidin-alkaline phosphatase ELISA. Each 
point was performed in triplicate; average absorbance at 4 05 nm 
is presented. Error bars represent SD. 

FIG. B illustrates the relative specificity of selected 
peptides (SEQ ID NOS : 9 and 62-70) for SH3 domains derived from 
different proteins. In particular, the binding affinities of the 
peptides for the SH3 domains of the following protein fusion 
targets were tested: Src SH3-GST, Yes SH3-GST, Grb2-GST, Crk SH3- 
GST, Abl SH3-GST, PLC7I SH2SH3-GST. Bound biotinylated peptide 
was detected with streptavidin-alkaline phosphatase- Each point 
was performed in triplicate; values are average signal 
(absorbance at 4 05 nm) above GST background, with error bars 
representing standard deviation. Hatched bars indicate 
saturation of the ELISA signal, 

FIG. 9 presents the results of competition experiments in 
which selected peptides were found to inhibit the binding of 
proteins from cell lysates to immobilized Src SH3-GST or Abl SH3- 
GST protein fusion targets. 

FIG. 10 presents a graph illustrating the increased rate of 
progesterone- induced maturation of oocytes injected with an SH3 
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domain -binding peptide, VLKRPLPIPPVTR (SEQ ID NO:64), of the 
present invention. Briefly, Stage VI oocyted were prepared and 
injected as previously described (see, Kay, B.K., in Methods in 
q^X; ^jol, (1991) 36:663-669). Oocytes were injected with 40 nL 
of 100 MM test peptide or water. After injection, the oocytes 
were placed in 2 fxg/nO, progesterone (Sigma, St. Louis, MO) and 
scored hourly for germinal vesicle breakdown (GVBD) . 

FIG, 11 shows the results of fluorescence experiments in 
which certain peptides, Panel A = VLKRPLPIPPVTR (SEQ ID NO:64), 
Panel B = GILAPPVPPRNTR (SEQ ID NO: 63), Panel C = RSTPRPLPPLPTTR 
(SEQ ID NO:67), of the invention were shown to localize within 
cellular compartments thought to contain Src or Src-related 
proteins . 

FIG. 12 illustrates a scheme for the generation of a biased 
peptide library. Oligonucleotides were synthesized, converted 
into double -stranded DNA, cleaved with restriction enzymes Xhol 
and Xbal, and cloned into the mBAX vector, described further 
below in the Examples section. The biased peptide region is 
situated at the N- terminus of mature pIII protein - 

FIG. 13 illustrates the peptide sequence encoded in the mBAX 
vector situated at the N-terminus of mature pIII protein. 

5. Detailed Description of the Preferred Embodiments 
5.1. General Considerations 

The present invention relates to peptides that 
exhibit a binding affinity for an SH3 domain, which domain has 
been found to be present in a number of physiologically 
significant proteins. In particular, peptides are disclosed 
which exhibit general binding characteristics to the SH3 domains 
found in a group of proteins, including but not limited to Abl, 
Src, Grb2, PLC-6, PLC-7, Ras GAP, Nek, and pBB PI-3' Kinase. 
Preferred peptides exhibit selective, if not specific, binding 
affinity for the SH3 domain of Src. As described herein, the 
peptides of the present invention include a core sequence, 
preferably a consensus seqeunce, and additional amino acid 
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residues that fiank the core sequence. These peptides, including 
the methods for their identification, are described in greater 
detail, below. 

Thus, in a specific embodiment of the invention, peptides 
are provided which have at least nine and up to cdx)Ut forty- five 
amino acid residues, including an amino acid sequence resembling 
the formula, 

R-2-L-P-5-6-P-8-9 (SEQ ID NO:10), 
positioned anywhere along the peptide. In the above-mentioned 
formula, each number represents an amino acid residue, such that 
2 represents any amino acid residue except cysteine, 5 and 6 each 
represents a hydrophobic amino acid residue, 8 represent? any 
amino acid residue except cysteine, and 9 represents a 
hydrophilic amino acid residue except cysteine. Each letter used 
in the formulas herein represent the standard one -letter symbol 
for the corresponding amino acid. When the peptide is a 9-mer, 
the peptide R-P-L-P-P-L-P-T-S (SEQ ID NO: 11) is excluded. The 
peptides of particular interest are those that exhibit a binding 
affinity for the SH3 domain of Src and Src-related proteins, 
including Yes, Fyn, Lyn, Lck, Hck and Pgr. Preferably, the 
peptides of the invention exhibit a binding affinity for the SH3 
domain of Src, which is at least three-fold, more preferably at 
least four-fold, most preferably at least about five-fold greater 
than that exhibited by the peptide RPLPPLP (SEQ ID NO: 9) . In 
still other embodiments, the peptides exhibit a binding affinity 
for the SH3 domain of Src which is at least ten- fold greater than 
that exhibited by the peptide RPLPPLP (SEQ ID NO: 9) . 

In specific embodiments, peptides are disclosed in which the 
various amino acid residues at the indicated positions may 
independently have the following preferred identities: 2 is a P, 
R, A, L, Q, EorS, more preferably P or R; 5 represents a P, M, 
I or L, more prefercibly P or M; 6 is a P, L, I or V, more 
preferably P or L; 8 is a T, R, P, I, N, E, V, S, A, G or L, more 
preferably T or R; and 9 is a T, R, S, H or D, more preferably T 
or R. Despite the preference for hydrophobic amino acid residues 
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at 5 and 6, in some cases it may be desirable to have hydrophilic 
amino acid residues at these positions. Specifically, amino acid 
residue 5 may be a R or S, and amino acid residue 6 may be a T 
or R. Likewise, while a hydrophilic amino acid residue is 
5 preferred at position 9, in some instances a hydrophobic residue, 
such as a P or A, may be desirable. 

The present invention also contemplates SH3 domain -binding 
peptides with a minimum length of 10, 11, 12, 13, 14, 15 or more 
amino acids. Such peptides contain additional amino acid 
10 residues flanking the core sequence of R-2-L-P-5 -6 -P (SEQ ID 

NO: 71) either at the C- terminal end, the N- terminal end or both. 
Thus, for example, such peptides include those that further 
comprise a C-terminal- flanking amino acid sequence of the formula' 
10, 10-11, 10-11-12, 10-11-12-13 (SEQ ID N0:12) or 10-11-12-13-14 
15 (SEQ ID NO:13), in which each number represents any amino acid 
residue except cysteine, such that the amino acid residue 10 is 
bound to the amino acid residue 9 by a peptide bond. In that 
case, specific embodiments include an amino acid residue 10 which^- 
is T, R, L, S, D, P, A or N, preferably T or R, an amino acid 
20 residue 11 which is R, P, A, Q, S or T, preferably R or P, an 

amino acid residue 12 which is P, S, R or T, preferably P or S, 
an amino acid residue 13 which is P, S, R, F, H or T, preferably 
P or S, and an amino acid residue 14 which is S, R, G or T, 
preferably, S or R, 
25 Furthermore, peptides are also provided which further 

comprise an N-terminal- flanking amino acid sequence of the 
formula 1', 2'-l', 3'-2'-l' or 4'-3'-2'-l' (SEQ ID NO:14) in 
which each number represents any amino acid residue except 
cysteine, such that 1' is bound to R by a peptide bond. In such 
30 a case, specific embodiments are provided in which the amino acid 
residue 1' is T, P, S, N, F, W, K, H, Q or G, preferably T or P, 
wherein the amino acid residue 2' is S, T, G, P, R, Q, L, A or H, 
preferably S or T, wherein the amino acid residue 3' is R, S, P, 
G, A, V, Y or L, preferably S or T, and wherein the amino acid 
35 residue 4' is R, S, V, T, G, L or F, preferably R or S . 
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In a particular emfaodimenc , a peptide is disclosed having at 
least thirteen and up to ' forty-five amino acid residues, 
including an amino acid sequence of the formula, 

3 ' -2' -1' -R-2-L-P-5-6-P-8-9-10 (SEQ ID NO: 15), positioned anywhere 
along the peptide, in which each number represents an amino acid 
residue, such that 3', 2', 1', 2, 8, and 10 each represents any 
amino acid residue except cysteine, 5 and 6 each represents a 
hydrophobic amino acid residue, and 9 represents a hydrophilic 
amino acid residue except cysteine, each letter being the 
standard one -letter symbol for the corresponding amino acid, said 
peptide exhibiting a binding affinity for the SH3 domain of Src . 
Preferred 13-mers include, but are not limited to, those- having 
an amino acid residue 5 which is a P or M, an amino acid residue 
1' which is T, P, S or N, an amino acid residue 2' which is S or 
T, an amino acid residue 3' which is R or S, and an amino acid 
residue 10 which is T or R. In all the SH3 domain-binding 
peptides described herein, the prohibition against the use of the 
hydrophilic amino acid residue cysteine (C) does not extend 
beyond the 7-mer "core" sequence and the additional amino acid 
residues flanking the core up to a total (core + flanking) of 
about 20 amino acids. That is, the occasional use of a cysteine 
is not absolutely prohibited. What should be kept in mind is 
that the potential for the formation of intramolecular disulfide 
bonds, to form a cyclic structure, be minimized as much as 
possible. ;^plicants have foxind that cyclized structures appear 
to be disfavored, at least with potential binding peptides of 
less than about 15 amino acid residues in length. The concern 
for the formation of cyclized structures comprising the core 
sequence diminishes with increasing size of the peptide. 
Presumably, a large enough structure, though cyclic, may allow 
the critical core sequence to adopt a more or less linear 
conformation. 

In particular, specific peptides are disclosed which exhibit 
binding affinities to SH3 domains. These include the peptides, 
RSTPRPLPMLPTTR (SEQ ID NO. 62), RSTPRPLPPLPTTR (SEQ ID NO. 67), 
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GILAPPVPPRNTR (SEQ ID NO. 63), VLKRPLPIPPVTR (SEQ ID NO. 64), 
GPHRRLPPTPATR (SEQ ID NO.' 65), and ANPSPATRPLPTR (SEQ ID NO. 66), 

Phage clones are also disclosed, along with the amino acid 
sequences that are responsible for SH3 domain binding. These 
5 phage clones are identified in Figure 5. 

In other embodiments of the present invention, SH3 domain- 
binding peptides are contemplated which have a total of 11, 13, 
14, 18, 20, 22, 23, 25, 30, 36, 38 or 45 amino acid residues. 

The peptides of the present invention, having been disclosed 
10 herein, may be prepared by any number of practicable methods, 
including but not limited to solution-phase synthesis, solid- 
phase synthesis, protein expression by a transformed host;, 
cleavage from a naturally-derived, synthetic or semi -synthetic 
polypeptide, or a combination of these techniques. 
15 The SH3 binding peptides exhibit a wide range of biological 

activity which includes the enhancement (or inhibition, depending 
on the particular peptide or the nature of the peptide's target 
molecule, in this case a protein bearing an SH3 domain) of the 
natural function or biological activity of the peptide's target 
20 molecule. For example, the interaction of the binding peptide of 
the present invention could result in the modulation of the 
oncogenic activity of the target molecule bearing the SH3 domain. 
If the target molecule has, in turn, a natural binding partner or 
ligand, the peptides of the present invention may also exhibit 
25 antagonistic or agonistic activity in relation to the biological 
activity of the natural binding partner. 

Thus, it is an object of the present invention to provide a 
method of activating Src or Src-related protein tyrosine kinases 
by administering an effective amount of the SH3 domain-binding 
3 0 peptides generally described herein. The intensity of the immune 
response can thus be stimulated, for example, by the increased 
production of certain lymphokines, such as TNF- alpha and 
interleukin-1 . As is generally known to those of ordinary skill 
in the art , a more intense immune response may be in order in 
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infection, viral or otherwise, or a malignancy. 

Furthermore, in a specific embodiment of the present 
invention, a conjugate compound is contemplated which comprises 
5 the peptide of the present invention and a second chemical 

moiety. The second chemical moiety can be selected from a wide 
variety of chemical compounds including the peptide itself. 
Typically, however, the second chemical moiety is selected to be 
other than the peptide of the present invention, including but 

10 not limited to an amino acid, a peptide other than an SH3 binding 
peptide of the present invention, a polypeptide or protein (i.e., 
the conjugate is a fusion protein) , a nucleic acid, a nucleoside, 
a glycosidic residue (i.e., any sugar or carbohydrate), a label 
or image -enhancing agent (including metals, isotopes, 

15 radioisotopes, chromophores, fluorophores (such as FITC, TRITC, 
and the like) , and enzyme substrates) , a drug (including 
synthetic, semisynthetic, and naturally-occurring compounds) , 
small molecules (e.g., biotin, hormones, factors) and the like. 
The peptide of the present invention can be conjugated to 

20 the second chemical moiety either directly (e.g., through 

appropriate functional groups, such as an amine or carboxylic 
acid group to form, for example, an amine, imine, amide, ester, 
acyl or other carbon-carbon bond) or indirectly through the 
intermediacy of a linker group (e.g., an aliphatic or aromatic 

25 polyhydroxy, polyamine, polycarboxylic acid, polyolefin or 
appropriate combinations thereof) . Moreover, the term 
"conjugate, " as used herein, is also meant to encompass non- 
covalent interactions, including but not limited to ionic, 
affinity or other complexation interactions. Preferably, such 

3 0 other non-covalent interactions provide definable, most 
preferably, isolatable chemical conjugate species. 

As described further herein, the peptides of the present 
invention have been shown to localize within certain cellular 
compartments which contain Src or Src-related prot ins. 

35 Consequently, the above -described conjugate can be utilized as a 
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delivery system for introduction of a drug to cells, tissues or 
organs that include SH3 domain -containing proteins. 

It should also be pointed out that the present invention 
seeks to provide a recombinant construct comprising a nucleic 
5 acid or its complement that includes codons or nucleotide 

sequences encoding a peptide having a region that binds to an SH3 
domain, preferably the Src SH3 domain. The recombinant nucleic 
acid may be a DNA or RNA polynucleotide. 

In a specific embodiment, the present invention contemplates 
10 a recombinant construct which is a transforming vector. Such 
vectors include those well known to those of ordinary skill in 
the art, which effect the transfer or expression of the 
nucleotide sequence after introduction to a host, such as 
recombinant plasmid, phage or yeast artificial chromosome. These 
15 vectors may be closed circular loops or they may be linearized. 
The vectors contemplated include those that exist 

extrachromosomally after host transformation or transf ection, as 
well as those that integrate within or even displace portions of 
the host chromosome. The vectors may be introduced to the cell 

20 with the help of transf ection aids or techniques well-known in 

the art. For example, these aids or techniques may take the form 
of electroporation, use of calcium chloride, calcium phosphate, 
DEAE dextran, liposomes or polar lipid reagents known as 
LIPOFECTIN or LIPOFECTAMINE . In addition, the present invention 

25 contemplates the direct introduction of the desired nucleic acid 
to the host cell, for instance, by injection. 

Transformed host cells are also obtained by the methods of 
the present invention which are capable of reproducing the 
polynucleotide sequences of interest and/or expressing the 

30 corresponding peptide products. A variety of hosts are 

contemplated, including prokaryotic and eukaryotic hosts. In 
particular, bacterial, viral, yeast, animal, and plant cells are 
potentially tramsf ormable hosts. Thus, a method is disclosed to 
obtain a trcinsformed host cell that can produce, preferably 

35 secrete, a peptide having a region that 'binds to an SH3 domain 
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comprising \a) providing an expression vector, preferably a 
secretory expression vector, comprising a nucleotide sequence 
encoding at least one copy of a peptide having a region that 
binds to an SH3 domain; and (b) introducing the vector to a 
5 competent host cell. 

The peptides, thus produced, may then be introduced to 
cells, tissues, organs, or administered to the subject for the 
purpose of modulating the biochemical activity of the SH3 domain- 
containing proteins present therein. Accordingly, in specific 

10 embodiments of the present invention, compositions are provided 
which comprise an SH3 domain-binding peptide, including a core 
sequence and flanking sequences, and a suitable carrier. 

The compositions contemplated by the present invention may 
also include other components, from those that facilitate the 

15 introduction or administration of the compositions to those that 
have their own innate activity, such as a prophylactic, a 
diagnostic or a therapeutic action. Such innate activity may be 
distinct from that of the peptides of the present invention or be 
complementary thereto. In any event, the compositions of the 

20 present invention include those that are suitable for 

administration into mammals, including humans. Preferably, the 
compositions (including necessarily the carrier) of the present 
invention are sterile, though others may need only be 
cosmetically, agriculturally or pharmaceutically acceptable. 

25 Still other compositions may be adapted for veterinary use. 

The compositions, including the drug delivery systems 
described herein, are contemplated to be administered in a 
variety of ways, such as parenterally, orally, enterally, 
topically or by inhalation. The compositions may also be 

3 0 adminstered intranasally , opthalmically or intravaginally . 

Furthermore, the compositions of the invention can take several 
forms, such as solids, gels, liquids, aerosols or patches. 

In another embodiment of the present invention a method is 
provided of identifying a peptide having a region that binds to 

35 an SH3 domain comprising: (a) providing an immobilized target 
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protein comprising an SH3 domain; (b) incubating the immobilized 
target protein with an aliquot taken from a phage -displayed 
random peptide library, which library includes peptides having a 
random sequence of 28 amino acid residues; (c) washing unbound 
5 phage from the immobilized target protein; (d) recovering the 
phage bound to the immobilized target protein; and (e) 
determining the relevant nucleotide sequence of said binding 
phage nucleic acid and deducing the primary sequence 
corresponding to the SH3 domain -binding peptide. Preferably, the 
10 method further comprises amplifying the titer of the recovered 

phage and repeating the steps of incubation, washing and recovery 
to provide SH3 domain-binding peptide -enriched phage. 

Any other mode by which the peptide library, random or 
otherwise, can be "displayed" can be utilized in the present 
15 invention, however. Moreover, the present applicants believe 
that longer random peptide sequences (e.g., >6 amino acid 
residues, preferably >10, and most preferably, >12) provide not 
only much greater diversity but also a richer degree of secondary- 
structure conducive to binding activity. If the random region of - 
20 the peptide is less than or equal to an 8-mer, it should 
preferably not be cyclized. 

5*2. Preparation of Random Peptide Ziibrarlae 

The preparation and characterization of the preferred 
phage -displayed random peptide libraries have been described 

25 elsewhere. See, for example, Kay, B.K. et al . in Gene (1992) 
128:59-65, for a description of the preparation of the phage- 
displayed random peptide library known as TSAR-9, more below. In 
particular, by cloning degenerate oligonucleotides of fixed 
length into bacteriophage vectors, recombinant libraries of 

30 random peptides can be generated which are expressed at the 

amino- terminus of the pi 1 1 protein on the surface of M13 viral 
particles. (There are 3-5 copies of the pIII -fusion on the 
surface of each particle.) Phage display offers several 
conveniences: first, the expressed peptides are on the surface 
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of the viral particles and accessible for interactions; second, 
the recombinant viral particles are stsUole (i,e,, can be frozen, 
exposed to pH extremes) ; third, the viruses can be amplified; and 
fourth, each viral particle contains the DNA encoding the 
5 recombinant genome. Consequently, these libraries can be 

screened by isolating viral particles that bind to targets. The 
isolates can be grown up overnight, and the displayed peptide 
sequence responsible for binding can be deduced by DNA 
sequencing . 

10 These libraries have approximately >10' different 

recombinants, and nucleotide sequencing of the inserts suggests 
that the expressed peptides are indeed random in amino acid 
sequence. These libraries are referred to herein as TSAR 
libraries, where TSAR stcuids for Xotally Synthetic £^finity 

15 Reagents. The preparation of the TSAR libraries are described 
further below. 

5*3. SH3 Binding Clones And Their Characteristics 

Accordingly, peptides have been isolated from an 
unconstrained random peptide library which exhibit a binding 

20 affinity for SH3 domains. Furthermore, the binding .affinities 

exhibited by the disclosed peptides differ in their selectivities 
with certain peptides showing comparable binding affinities for 
SH3 domains derived from different proteins, while others 
manifest greater affinities for specific SH3 domains. 

25 The amino acid sequence of various peptides isolated by the 

present method are listed in Figure 5. As can be seen from this 
list, certain groups of SH3 domain binding peptides are isolated 
from three separate random peptide libraries, each based on a 
different type of random peptide insert, all displayed at the 

30 amino- terminus of the pIII protein on the surface of M13 viral 

particles. Ten clones were isolated from the R8C library, seven 
from the TSAR- 12 library, and seven from the TSAR- 9 library. The 
sequences are presented to highlight the particular amino acid 
residues' believed to bind directly to the SH3 domain, as well as 
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to point out the remaining amino acid resiudes of the random 
insert and the viral flanking sequences and complementary site 
amino acid residues common to each group of clones. The 
frequency with which each particular clone is found in each 
5 library is also indicated in Figure 5. Thus, clones T12.SRC3.1 
and T12-SRC3.2 are by far the most abundant clones found among 
the three libraries. 

Interestingly, all the binding peptides are found to have 
the proline-rich amino acid residue motif, which is apparently 

10 responsible for binding, the motif being located predominantly at 
the C- terminal end of the insert, although each clone also 
contains an insert at the N- terminal end. The significance of 
this observation is not presently understood, although this 
finding may indicate the possible importance of the C-terminal 

15 viral flanking sequences in SH3 domain binding. 

Indeed, a synthetic peptide bearing only the core consensus 
sequence RPLPPLP (SEQ ID N0:9) was less effective in binding to 
target SH3 domains than synthetic peptides that also included 
additional amino acid residues flanking the core sequences. 

20 Thus, 13-mers and 14-mers having the sequences RSTPRPLPMLPTTR 

(SEQ ID NO: 62), RSTPRPLPPLPTTR (SEQ ID NO: 67), GIIAPPVPPRNTR (SEQ 
ID NO:63), GPHRRLPPTPATR (SEQ ID NO:65), and VLKRPLPIPPVTR (SEQ 
ID NO: 64) have been prepared and shown to bind to SH3 domains, 
such as those of Src and Yes, much more avidly than the 7-mer, 

25 RPLPPIiP (SEQ ID N0:9). The 13-mer ANPSPATRPLPTR (SEQ ID N0:66) 
has been shown to have binding affinities comparable to the core 
consensus seqpience. In each case, the 13-mers comprise a 7-mer 
"core" sequence plus additional amino acid residues flanking 
same, some of which additional amino acid residues are 

3 0 contributed by the viral flanking sequences. 

Thus, in one embodiment of the present invention, a 7-mer 
core includes a consensus motif of the formula RXLP00P (SEQ ID 
NO:71), wherein R is arginine, L is leucine, P is proline, X 
represents any amino acid except cysteine and 0 represents a 

35 hydrophobic amino acid residue. By "hydrophobic amino acid 
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residue." tiie applicants mean to include F, Y, V, A, I, L, P 

or M, each lecter representing the standard one-letter 
designation for the corresponding amino acid residue. 

Furthermore, a preferred 9-mer peptide comprising two 
additional amino acids on the C- terminal end of the core sequence 
is envisioned having a consensus motif of the formula RXLP^^PX^. 
In this preferred 9-mer consensus motif, the symbol ^ represents 
a hydrophilic amino acid residue, except cysteine. By 
"hydrophilic amino acid residue," the applicants mean to include 
K, R, H, D, E, N, Q, T, S or C, and the other symbols are as 
defined above. For the purposes of the present invention, a 
glycine residue (G) may be considered either a hydrophobic or a 
hydrophilic amino acid residue. The one-letter symbols B and Z, 
which stand for N or D and Q or E, respectively, are considered 
hydrophilic amino acid residues. 

Particular 13-mer peptides of the present invention include 
those listed, below. It is noted, however, that not all the 
following 13-mer peptides correlate strictly to or comply with 
the preferred 9-mer consensus motif, described above. Those 
peptides that do not comply (indicated in italics, with the non- 
complying amxno acid residues underscored) Ccui, thus, be 
described as "resembling" those that do comply (indicated in 
normal type) with the preferred 9-mer consensus motif: 



PGFRELPPLPPSR 


(SEQ 


ID 


NO: 


72) , 


AQSRPLPIPPETR 


(SEQ 


ID 


NO: 


73) , 


VLKRPLPIPPVTR 


(SEQ 


ID 


NO: 


64) , 


PPNSPLPPLPTHL 


(SEQ 


ID 


NO: 


74) , 


TGRQPLPPLPNDS 


(SEQ 


ID 


NO: 


75) , 


YSTRPYPPIXRES 


(SEQ 


ID 


NO: 


76) , 


SHKS.RIjPPLPTRP 


(SEQ 


ID 


NO: 


77) , 


YRFRALPSPPSAS 


(SEQ 


ID 


NO: 


78) , 


GPHRRLPPXPATR 


(SEQ 


ID 


NO: 


65) , 


LAQRQLPPXPGRD 


(SEQ 


ID 


NO: 


79) , 


ALQRRLPEIPPEA 


(SEQ 


ID 


NO: 


80) , 


PATRPLP2BPSRT 


(SEQ 


ID 


NO: 


81) , 


YSTRPLPSBPSRT 


(SEQ 


ID 


NO: 


82) , 


XPGRlhLLP&ERR 


(SEQ 


ID 


NO: 


83) , 


SGGIL^PVPPRN 


(SEQ 


ID 


NO: 


84) , 


RSTRPLPILPRTT 


(SEQ 


ID 


NO: 


85) , 


STPRPLPMLPTTR 


(SEQ 


ID 


NO: 


86) , 


STNRPLPMIPTTR 


(SEQ 


ID 


NO: 


87) , 


RSTRPLPSLPJTT 


(SEQ 


ID 


NO: 


88) , 


STSRPLP^LPTTR 


(SEQ 


ID 


NO: 


89) , 


RSTRSIiPPLPPTT 


(SEQ 


ID 


NO: 


90) , 


RSTRQLPI PPTTT 


(SEQ 


ID 


NO: 


91) , 


STPRPLPLIPTTP 


(SEQ 


ID 


NO: 


92) , 


RSTRPLPPTPLTT 


(SEQ 


ID 


NO: 


93) , and 
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RSTRPQPPPPITT (SEQ ID NO:94)* Accordingly, other peptides not 
specifically disclosed, which either comply with or "resemble" 
the preferred 9-mer consensus motif, can be readily envisioned by 
those of ordinary skill in the art and are considered to be 
5 equivalent to those that are specifically disclosed above. In 

particular, non-compliance at positions 1 (S, G, and I, in place 
of R, are tolerated) , 3 (V, A, and Q, in place of L, are 
tolerated) , 4 (L, in place of P, is tolerated) , 5 (hydrophilic 
amino acid residues, S, R, and T, are tolerated in place of a 
10 hydrophobic amino acid residue), 6 (hydrophilic amino acid 

residues, R and T, are tolerated in place of a hydrophobic amino 
acid residue) , 7 (T, and S, in place of P, are tolerated) , and 9 
(P and A are tolerated in place of a hydrophilic amino acid 
residue) have been observed. 

15 5.3*1. Binding Specificities 

It has been discovered that certain of the binding 
peptides disclosed have a greater relative binding affinity for 
one SH3 domain over another. Referring now to Figure 8, the 
relative binding affinities of the various peptides described 

20 above toward different SH3 domain targets are graphically 

presented. As one can see, the relative binding affinities of 
the respective peptides can differ by orders of magnitude. Thus, 
as shown in Figure 8, the peptide GPHRRLPPTPATR (SEQ ID NO:65), 
having the relevant sequence of the phage clone identified as 

25 T12.SRC3,3, is specific to Src family SH3 domains, including, but 
not limited to, Src, Yes, Lck, Hck, Fgr, Fyn, and Lyn. This SH3 
binding peptide has little affinity for SH3 domains derived from 
PLC7 or Grb2. On the other hand, the peptide GIIiAPPVPPRNTR (SEQ 
ID NO: 63), corresponding to the relevant sequence of the phage 

3 0 clone T12.SRC3.1, which is one of the most abundant binding 

clones found by the present method, binds generically to a broad 
range of SH3 domains, including Src, PLC7, and Grb2 . 

On an intermediate level, the present invention has also 
uncovered a peptide, VL.KRPLPIPPVTR (SEQ ID NO: 64), corresponding 
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wC ^iiw xsl^vszi^ scc^iisncs of tiis pha^s clous T12.SRC3,€, v?iiich. is 
Src preferential; that is, this peptide exhibits strong binding 
affinities for members of the Src family, some binding affinities 
for Grb2 proteins, but little binding affinities for PLCy 
domains. The peptide ANPSPATRPLPTR (SEQ ID NO: 66), corresponding 
to the relevant sequence of the phage clone T12 • SRC3 • 2 , also 
exhibits Src family specificity similar to GPHRRLPPTPATR (SEQ ID 
NO:65). The peptides RSTPRPLPMLPTTR {R8C.YES3.5; SEQ ID NO:62) 
and RSTPRPIiPPIiPTTR (representative consensus motif; SEQ ID NO: 67) 
are highly specific for SH3 domain of Src, Yes, and other Src- 
related proteins. 

5 •4. Farther Discussion of Binding Experiments 

At the outset it is apparent that the binding affinity 
of certain peptides to the SH3 domain of Src and Src -related 
proteins is governed by more than just the presence of the 
preferred core consensus sequences, RPLPPLP (SEQ ID NO: 9) or 
RPLPMLP (SEQ ID NO:95; i.e., RPLP (P/M) LP, SEQ ID NO:96). Thus, 
while the synthetic peptides RSTPJePLPMIiPTTR (R8C.YES3.5; SEQ ID 
NO: 62) and RSTPRPLPPLPTTR (consensus; (SEQ ID NO: 67) exhibit a 
strong specific binding affinity for Src SH3, the other synthetic 
peptides tested also exhibited an avid binding affinity to SH3 
domains relative to the 7-mer, RPLPPLP (SEQ ID NO: 9) . These 
other peptides, GILAPPVPPiZNTR (SEQ ID NO:63), VLKJePLPJPPVTR (SEQ 
ID NO: 64), GPHRRLPPTPATR (SEQ ID NO: 65), and ANPSPATRPLPTR (SEQ 
ID NO : 66 ) , sport core sequences and flauiking sequences that do 
not closely adhere to the preferred core consensus sequences. 
Thus, these results suggest that binding affinity to SH3 domains 
is governed to a large extent by the nature of the amino acid 
residues flanking the core 7-mer sequence. 

The binding characteristics of Src SH3- selected peptides was 
determined using synthetic biotinylated peptides corresponding to 
the sequences displayed by Src SH3- selected phage. These 
biotinylated peptides were assayed for direct binding to 
immobilized Src SH3-GST. Each of the five library-derived 
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peptides tested were found to bind to Src SH3-GST and Yes SH3-GST 
over background (Figure 8-) . Furthermore, a strong correlation 
was observed between the similarity of a given peptide to the 
preferred core consensus sequence RPLP(P/M)LP and the peptide's 
5 affinity for Src SH3-GST. The core sequence of the clone 

T12.SRC3.1 (GILAPPVPP12NTR; SEQ ID NO: 63) appears to provide more 
generic SH3 domain-binding characteristics. 

Experiments comparing the relative binding of various phage 
clones to SH3 domains taken from a variety of proteins 
la demonstrated the preference of these clones for Src and Src- 

related SH3 domains over SH3 domains taken from other proteins. 

It was further found that while the 7-mer having the 
consensus sequence RPLPPLP (SEQ ID NO: 9) bound to Src SH3-GST 
only weakly, peptides comprising the consensus sequence flanked 
15 by residues encoded by one of the Src SH3 -selected clones 

(R8C.yES3.5) , RSTP (SEQ ID NO:97) at the N-terminal end and TTR 
at the C-terminal end, bound significantly better , than any of the 
peptides tested (Figure 7) . Thus, as stated previously, 
sequences that flank the RPLP(P/M)LP (SEQ ID NO: 96) core appear 
20 to be important contributors to SH3 binding. It is further 
surmised that a peptide having or resembling the sequence 
RSTPAPPVPPRTTR (SEQ ID NO: 98) should exhibit strong but generic 
binding to a variety of SH3 domains. 

Similarly, it is observed that most of the Src SH3 -binding 
25 motifs are located near the carboxy- terminus of the random 

peptides, adjacent to sequences which are fixed in every clone 
(Figure 5) . The exceptional clones tend to possess sequences 
that resemble motifs that include fixed flanking sequences. This 
clustering contrasts with previous results, in which binding 
30 motifs are distributed throughout the random peptide. Kay, B.K., 
et al., in Gene (1993) 128:59-65, 

The binding of the library-derived Src SH3 -binding peptides 
was compared to that of peptides corresponding to proline -rich 
regions of natural proteins. Peptides corresponding to SH3- 
35 binding regions in human PI-3' Kinase (-KISPPTPKPRPPRPLPV; SEQ ID 
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MO: 69) and hunian SCSI. 20 {GTVEPVPPPVPPRRRPESA; SEQ ID NO: 63), as 
well as a proline -rich r-egion of the cytoskeletal protein 
vinculin (LAPPKPPLPEGEV; SEQ ID NO: 70), bound Src SH3 less well 
than the library-derived peptides (Figure 7) . 

As mentioned above, the relative specificity of binding was 
explored. Thus, the relative binding of Src SH3 -selected 
peptides to equal amounts of GST fusions to SH3 domains from 
different proteins was determined (Figure 8) . While all of the 
library- derived peptides bound the Src and Yes SH3 domains almost 
equally well, none of the peptides (with the exception of peptide 
T12.SRC3.1, the most divergent peptide tested) bound the SH3 
domains of Grb2, Crk, Abl or PLC7I appreciably. Thus, the 
library-derived peptides, in contrast with a peptide derived from 
SOSl, exhibit SH3 binding that is relatively specific for Src- 
f amily members . 

Next, it was determined whether the binding to the Src SH3 
domain was qualitatively like the interactions of. the SH3 domain 
and natural proteins found in cell lysates. Thus, radiolabeled 
proteins were prepared from NIH 3T3 cell lysates and 
chromatographed over Src SH3-GST immobilized on glutathione 
linked Sepharose. SDS-PAGE shows that a number of proteins can 
be affinity purified in this manner. The synthesized peptides 
bind quite well to the Src SH3 domain, as they can compete the 
binding of radiolabeled proteins from cell lysates to immobilized 
Src-GST, with an ICso of 1-10 mM (Figure 9) . In conclusion, the 
peptides can efficiently block the interaction of cellular 
proteins with Src SH3 in vitro. 

Moreover, Xenopus laevis oocytes injected with mRNA encoding 
constitutively active Src undergo progesterone- induced maturation 
at an accelerated rate relative to oocytes injected with water or 
c-Src mRNA. Unger, T.F. and Steele, R.E. in Mol , Cell. Biol. 
(1992) 12:5485-5498. To explore the ability of the library- 
derived Src SH3 -binding peptides to exert a biochemical effect in 
vivo, the influence of the peptides on the maturation of Xenopus 
laevia oocytes was examined. Hence, stage VI oocytes were 
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injected with peptide, exposed to progesterone, and scored for 
germinal vesicle breakdown. Figure 10 shows that the rate of 
maturation was accelerated by approximately one hour when oocytes 
were injected with the SH3 -binding peptide consisting of RPLPPLP 
5 (SEQ ID NO: 9) flanked by residues from clone T12.SRC3.6 

(VLKRPLPIPPVTR; SEQ ID NO: 64), but not with water or a peptide 
corresponding to a proline-rich segment of vinculin 
(LAPPKPPLPEGEV; SEQ ID NO: 70) as controls. The magnitude of this 
effect is roughly equivalent to that seen with injection of mRNA 

10 encoding constituitively active Src . See, e.g., Figure 3B in 

Unger, T.F. and Steele, R.E., supra. This result, suggests that 
the library-derived Src SH3 -binding peptide is effectively 
relieving an inhibitory effect of the Src SH3 domain upon Src PTK 
activity. This model is consistent with a number of studies 

15 which have demonstrated an inhibitory effect of the Src SH3 

domain upon Src kinase and transforming activity. See, e.g., 
Okada, M. , et al . , supra; Murphy, S.M., et al . , supr-a; and 
Superti-Furga, G., et al . , supra. 

5.5* Diagnostic And Therapeutic Agents 
20 Based On SH3 Binding Peptides and Additional Methods 

of Their Use 

As already indicated above, the present invention also 
seeks to provide diagnostic, prophylactic, and therapeutic agents 
based on the SH3 binding peptides described herein. 

25 In one embodiment, diagnostic agents are provided, 

preferably in the form of kits, comprising an SH3 domain -binding 
peptide and a detectable label conjugated to said peptide 
directly, indirectly or by complexation, said peptide comprising: 
(i) a core sequence motif of the formula RXLP0</>P (SEQ ID N0:71), 

3 0 wherein X represents any amino acid except cysteine and <t> 

represents a hydrophobic amino acid residue, including F, Y, W, 
V, A, I, L, P, M or G, each letter representing the standard one- 
letter designation for the corresponding amino acid residue; and 
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(ii) two or more additional amino acid residues flanking said 
core sequence at its C- terminal end, N- terminal end or both. 

The diagnostic agents of the present invention can be used 
to detect the presence of SH3 domains of a generic or specific 
5 type in cells, tissues or organs either in vitro or in vivo. For 
in vivo applications, the diagnostic agent is preferably mixed 
with a pharmaceutically acceptable carrier for administration, 
either enterally, parenterally or by some other route dictated by 
the needs of the particular application. 

10 In a particular embodiment, for example, an assay based on a 

fusion product is contemplated which comprises a Src SH3 domain- 
binding peptide of the invention and a substrate for deregulated 
or "activated" Src. For instance, a muscle biopsy, taken from a 
subject suspected of being infected by the Rous sarcoma virus, 

15 can be treated with an effective amount of the fusion product. 
By subsequent analysis of the degree of conversion of the 
substrate, one can potentially detect infection by the Rous 
sarcoma virus in the subject, particularly mammals, especially 
chickens. The presence of the retrovirus, which causes the 

20 expression of deregulated or "activated" Src, may thus be 

indicated by unusually high levels of Src as revealed by large 
amounts of the converted substrate. See, for example, Paxton, 
W.G. et al., in Biochem. Biophvs . Res. Commun. (1994) 200(1) :260- 
267 (detection of phosphorylated tyrosine and serine residues of 

25 angiotensin II ATI receptor, a substrate of Src family tyrosine 
kinases) ; another suitable substrate may be the protein p68 
(Fumagalli, S. et al . , in Nature (1994) 368 (6474) : 871-874 ; 
Taylor, S.J. and Shalloway, D., in Ibid, at 867-871. 

Alternatively, the enzyme can be isolated by selective 

3 0 binding to a form of the SH3 domain -binding peptides of the 
present invention (e.g., biotin-peptide conjugate). After 
isolation of the protein-peptide conjugate complex (e.g., on a 
column comprising streptavidin) , the activity of the enzyme can 
then be assayed by conventional methods to determine its level of 

3 5 protein kinase activity which can be taken as an indication of 

- 30 - 



BNSDOCtD: <WO_9e0364flA1JLj> 



wo 96/03649 



PCT/US9S/09382 



10 



the presence of the deregulated or "activated" form of the 
enzyme. An assay for Src. kinase has been described by Klinz and 
Maness, in Neuroprotocol^g (a companion to Neuroscience ) (1992) 
1(3) :224-231. 

Moreover, the diagnostic agents of the invention can also 
serve as imaging agents of cells, tissues or organs, especially 
those that contain proteins with an SH3 domain. For example, 
neural cells (e.g., neurons, other areas of the brain), 
osteoclasts, osteoblasts, platelets, immune cells, and other 
dividing cells are known to express or contain proteins with SH3 
domains. Thus, an image can be taken of portions of the body to 
serve as a baseline for subsequent images to detect physiologic 
or biochemical changes in the sxibject's body. For instance, 
changes in the condition of cellular levels of Src or a 
15 transformation of the cellular Src to an "activated" form may be 
detected using the diagnostic or imaging agents of the present 
invention. 

Accordingly, it has been demonstrated that an SH3 -binding 
peptide tagged with a fluorescence emitter can provide an image 

20 of the cytoskeleton. The images are presented in Figure 11. As 
can be seen from Figure 11, panels A, B, and C show the 
fluorescence image that is obtained on treating NIH 3T3 
fibroblasts with SH3 domain-binding peptides modified to include 
a fluorescent tag. In sharp contrast, panel D shows only a dark 

25 image that is produced when the cells are treated with a proline- 
rich segment of vinculin as a control. 

In another embodiment, an SH3 domain-binding peptide- 
horseradish immunoperoxidase complex or related 

immunohistochemical agent could be used to detect and quantitate 
30 specific receptor molecules in tissues, serum or body fluids. In 
particular, the present invention provides useful diagnostic 
reagents for use in immunoassays, Southern or Northern 
hybridization, and in situ assays. Accordingly, the diagnostic 
agents described herein may be suitable for use in vitro or in 
3 5 vi vo . 

- 31 - 



BN800C1D: <WO_9e0364fiA1JU> 



wo 96/03649 



PCT/US95/09382 



In addition, the diagnostic or imaging agent of the present 
invention is not limited by the nature of the detectsible label. 
Hence, the diagnostic agent may contain one or more such labels 
including, but not limited to, radioisotope, fluorescent tags, 
paramagnetic substances, heavy metals, or other image -enhancing 
agents. Those of ordinary skill in the art would be familiar 
with the range of Icd^el and methods to incorporate or conjugate 
them into the SH3 domain-binding peptide to form diagnostic 
agents . 

In yet a further embodiment, pharmaceutical compositions are 
provided comprising an SH3 domain-binding peptide and a 
pharmaceutically acceptable carrier. In a specific embodiment of 
the invention, the pharmaceutical composition is useful for the 
modulation of the activity of SH3 domain- containing proteins. By 
"modulation" is meant either inhibition or enhancement of the 
activity of the protein target. Accordingly, a pharmaceutical 
composition is disclosed comprising an SH3 domain-binding peptide 
and a pharmaceutically acceptable carrier, said peptide 
comprising; (i) a 9-mer sequence motif of the formula RXLP00PX^ 
(SEQ ID NO: 10), wherein X represents any amino acid except 
cysteine, <t> represents a hydrophobic amino acid residue, and 
wherein ^ is a hydrophilic amino acid residue except cysteine, 
each letter representing the standard one-letter designation for 
the corresponding amino acid residue; and, optionally, (ii) 
additional amino acid residues flanking the 9-mer sequence at its 
C- terminal end, N- terminal end or both, up to a total of 45 amino 
acid residues, including said 9-mer sequence. Preferably, the 
peptide comprises at least one, more preferably at least two, and 
most preferably at least three additional amino acids flanking 
the 9-mer sequence. 

As stated above, the therapeutic or diagnostic agents of the 
invention may also contain appropriate pharmaceutically 
acceptable carriers, diluents and adjuvants. Such pharmaceutical 
carriers can be sterile liquids, such as water and oils including 
those of petroleum, animal, vegetable or synthetic origin, such 
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as peanut oil, soybean oil, mineral oil, sesame oil and the like. 
Water is a preferred carrier when the pharmaceutical composition 
is administered intravenously. Saline solutions and aqueous 
dextrose and glycerol solutions can also be employed as liquid 
5 carriers, particularly for injectable solutions. Suitable 
pharmaceutical excipients include starch, glucose, lactose, 
sucrose, gelatin, malt, rice, flour, chalk, silica gel, magnesium 
carbonate, magnesium stearate, sodium stearate, glycerol 
monostearate, talc, sodium chloride, dried skim milk, glycerol, 

10 propylene, glycol, water, ethanol and the like. These 

compositions can take the form of solutions, suspensions, 
tablets, pills, capsules, powders, sustained- release formulations 
and the like. Suitable pharmaceutical carriers are described in"' 
"Remington^ s Pharmaceutical Sciences" by E.W. Martin. 

15 Such compositions will contain an effective therapeutic 

amount of the active compound together with a suitable amount of 
carrier so as to provide the form for proper administration to 
the subject. While intravenous injection is a very effective 
form of administration, other modes can be employed, including 

20 but not limited to intramuscular, intraperitoneal, and 

subcutaneous injection, and oral, nasal, enteral, and parenteral 
administration . 

The therapeutic agents and diagnostic agents of the instant 
invention are used for the treatment and/or diagnosis of animals, 

25 and more preferably, mammals including humans, as well as dogs, 
cats, horses, cows, pigs, guinea pigs, mice and rats. 
Accordingly, other methods contemplated in the present invention, 
include, but are not limited to, a method of modulating, i.e., 
inhibiting or enhancing, bone resorption in a mammal (see, e.g., 

30 Hall, T.J., in Biochem . Bionhvs . Res . Commun . (1994) 199 (3) :1237- 
44) , a method of disrupting protein tyrosine kinase-mediated 
signal transduction pathways or a method of regulating the 
processing, trafficking or translation of RNA in a cell by 
introducing or administering an effective amount of an SH3 
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Taylor, S.J. and Shalloway, D., supra). 

The diagnostic or therapeutic agents of the present 
invention can be modified by attachment to soluble macromolecules 
5 such as proteins, polysaccharides, or synthetic polymers. For 
example, the peptide could be coupled to styrene-maleic acid 
copolymers (see, e.g., Matsumura and Maeda, Cancer Res . (1986) 
46:6387), me thacryl amide copolymers (Kopececk and Duncan, J. 
Controlled Release (1987) 6:315), or polyethylene glycol (PEG) 

ID (e.g., Hershfield and Buckley, N. Enal . J. Med. (1987) 316:589; 

Ho et al.. Drug Metab. Dispos. (1986) 14:349; Chua et al , , Ann. 
Intern. Med. (1988) 109:114). The agents, if desired, are 

further targeted by attachment to an antibody, especially a 
monoclonal antibody. Such antibodies include but are not limited 

15 to chimeric, single chain. Fab fragments, and Fab expression 
libraries. In one embodiment the agent is coupled to the 
macromolecule via a degradable linkage so that it will be 
released in vivo in its active form. 

In another embodiment, the therapeutic or diagnostic agent 

20 may be delivered in a vesicle, in particular a liposome. See, 

Langer, Science (1990) 249:1527-1533; Treat et al . , in Liposomes 
in the Therapy of Infectious Disease and Cancer , Lopez -Berestein 
and Fidler (eds.), Liss, New York (1989) pp. 353-365; Lopez- 
Berestein, ibid . . pp. 317-327. 

2 5 In yet another embodiment, the therapeutic or in vivo 

diagnostic agent can be delivered in a controlled release system. 
In one embodiment, a pump may be used (see Langer, supra; Sefton, 
CRC Crit. Ref. Biomed. Eng. (1987) 14:201; Buchwald et al . , 
Surgery (1980) 88:507; Saudek et al . , N. Engl. J. Med. (1989) 

30 321:574). In another embodiment, polymeric materials may be used 
(see Medical Applications of Controlled Release, Langer and Wise 
(eds.), CRC Pres., Boca Raton, Florida, 1974; Controlled Drug 
Bioavailability, Drug Product Design and Performance, Smolen and 
Ball (eds.) Wiley, New York 1984; Raner and Peppas, J . Macromol . 

35 Sc i ■ Rev . Macromol . Chem . (1983) 23:61;' see, also. Levy et al . , 
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Science (1985) 228:190; During et al . , Ann. Neurol. (1989) 
25:351; Howard et al . , J. ' Neuroaurg. (1989) 71:105). In a 
preferred embodiment, a controlled release system may be placed 
next to the therapeutic target, thus requiring only a fraction of 
the systemic dose (see, e.g., Goodson, in Medical Applications of 
Controlled Release, supra, (1984) 2:115-138). It will be 
recognized by one of ordinary skill in the art that a particular 
advantage of the invention is that a peptide will not be subject 
to the problems of denaturation and aggregation associated with 
proteins held in the warm, most environment of a body in a 
controlled release system. 

Other controlled release systems are discussed in the review 
by Langer, in Science (1990) 249:1527-1533. 
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€ , EXAMPLSS 

Preparation o£ tlie TSAR- 9 Library 

6.1.1. SynthesiB auad Assembly o£ 
Oligonucleotides 

Figure 1 shows the foarmula of the 
oligonucleotides and the assembly scheme used in construction of 
the TSAR- 9 library. The oligonucleotides were synthesized with 
an applied Biosystems 380a synthesizer (Foster City, CA) , and the 
full-length oligonucleotides were purified by HPLC. 

Five micrograms of each of the pair of oligonucleotides were 
mixed together in buffer (10 mM Tris-HCl, pH 8.3, 15 mM KCl , 
0.001% gelatin, 1.5 mM magnesium chloride), with 0.1 % Triton X- 
100, 2 mM dNTP's, and 2 0 units of Tag DNA polymerase. The 
assembly reaction mixtures were incubated at 72 °C for 30 seconds 
and then 30 °C for 30 seconds; this cycle was repeated 60 times. 
It should be noted that the assembly reaction is not PGR, since a 
denaturation step was not used. Fill-in reactions were carried 
out in a thermal cycling, device (Ericomp, LaJolla, CA) with the 
following protocol: 30 seconds at 72 **C, 30 seconds at 30 
repeated for 6 0 cycles. The lower temperature allows for 
annealing of the six base complementary region between the two 
sets of the oligonucleotide pairs. The reaction products were 
phenol/chloroform extracted and ethanol precipitated. Greater 
than 90% of the nucleotides were found to have been converted to 
double strcinded synthetic oligonucleotides. 

After resuspension in 300 ;xL of buffer containing 10 mM 
Tris-HCI, pH 7.5, 1 mM EDTA (TE buffer), the ends of the 
oligonucleotide fragments were cleaved with Xba I and Xho I (New 
England BioLabs, Beverly, MA) according to the supplier's 
recommendations. The fragments were purified by 4% agarose gel 
electrophoresis. The band of correct size was removed and 
electroeluted, concentrated by ethanol precipitation and 
resuspended in 100 TE buffer. Approximately 5% of the 
assembled oligonucleotides can be expected to have internal Xho I 
or Xba I sites; however, only the full-length molecules were used 
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in the ligation step of the assembly scheme. The concentration 
of the synthetic oligonucleotide fragments was estimated by 
comparing the intensity on an ethidium bromide stained gel run 
along with appropriate quantitated markers. All DNA 
5 manipulations not described in detail were performed according to 
Maniatis, supr^. 

To demonstrate that the assembled enzyme digested 
oligonucleotides could be ligated, the synthesized DNA fragments 
were examined for their ability to self-ligate. The digested 

10 fragments were incubated overnight at 18 in ligation buffer 

with T4 DNA ligase. When the ligation products were examined by 
agarose gel electrophoresis, a concatamer of bands was visible 
upon ethidium bromide staining. As many as five different unit 
length concatamer bands (i.e., dimer, trimer, tetramer, pentamer/' 

15 hexamer) were evident, suggesting that the synthesized DNA 
fragments were efficient substrates for ligation. 

6.1.2. Construction of Vectors 

The construction of the M13 derived phage vectors 
useful for expressing a TSAR library has been recently described 

20 (Fowlkes, D. et al . BioTech. (1992) 13:422-427). To express the 

TSAR- 9 library, an M13 derived vector, m663, was constructed as 
described in Fowlkes . The m663 vector contains the pIII gene 
having a c-irryc-epitope, i.e., as a stuffer fragment, introduced 
at the mature N- terminal end, flanked by Xho I and Xba I 

25 restriction sites (see also. Figure I of Fowlkes) , 

6.1.3. Expression of the TSAR- 9 Library 

The synthesized oligonucleotides were then ligated 
to Xho I and Xha I double -digested m66 3 RF DNA containing, the 
pIII gene (Fowlkes) by incubation with ligase overnight at 12 *»C. 
30 More particularly, 50 ng of vector DNA and 5 ng of the digested 
synthesized DNA and was mixed . together in 50 /xL ligation buffer 
(50 mM Tris, pH 8.0, 10 mM MgClj, 20 mM DTT, 0 . 1 mM ATP) with T4 
DNA ligase. After overnight ligation at 12*'C, the DNA was 
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concentrated by ethanol precipitation and washed with 70% 
ethanol . The ligated DNA* was then introduced into E. coli 
(DH5aF' ; GIBCO BRL, Gaithersburg, MD) by elect roporat ion. 

A small aliquot of the electroporated cells was plated and 
5 the number of plaques counted to determine that 10* recombinants 
were generated. The library of E, coli cells containing 
recombinant vectors was plated at a high density (-400,000 per 
150 mM petri plate) for a single amplification of the recombinant 
phage. After 8 hr, the recombinant bacteriophage were recovered 
10 by washing each plate for 18 hr with SMG buffer (100 mM NaCl, 10 
mM Tris-HCl, pH 7.5, 10 mM MgClj, 0.05% gelatin) and after the 
addition of glycerol to 50% were frozen at -80 **C. The TSAR-9 
library thus formed had a working titer of -2 x 10^^ pf u/ml . 

6.2. Preparation of TSAR- 12 Library 

15 Figure 2 shows the formula for the synthetic 

oligonucleotides and the assembly scheme used in the construction 
of the TSAR- 12 library. As shown in Figure 2, the TSAR- 12 
library was prepared substantially the same as the TSAR-9 library 
described in Section 6,1 above with the following exceptions: 

2 0 (1) each of the variant non-predicted oligonucleotide sequences, 

i.e., NNB, was 30 nucleotides in length, rather than 54 
nucleotides; (2) the restriction sites included at the 5' termini 
of the variant, non-predicted sequences were Sal I and Spe 1, 
rather than Xho 1 and Xha I; and (3) the invariant sequence at 

25 the 3 ' termini to aid annealing of the two strands was GCGGTG and 
CGCCAC rather than CCAGGT and GGTCCA (5' to 3 ' ) . 

After synthesis including numerous rounds of annealing and 
chain extension in the presence of dNTP's and Tag DNA polymerase, 
and purification as described above in Section 6.1.1, the 

30 synthetic double stranded, oligonucleotide fragments were 

digested with Sai I and Spe 1 restriction enzymes and ligated 
with T4 DNA ligase to the nucleotide sequence encoding the M13 
pIII gene contained in the m663 vector to yield a library of 
TSAR-expression vectors as described in Sections 6.1.2 and 6.1.3. 
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The ligated DNA was then introduced into E. coll (DH5aF' ; GIBCO 
BRL, Gaithersburg, MD by electroporation. The library of E. coli 
cells were plated at high density (-400,000 per 150 mm petri 
plate) for amplification of the recombinant phage. After about 8 
5 hr, the recombinant bacteriophage were recovered by washing, for 
18 hr with SMG buffer and after the addition of glycerol to 5.0% 
were frozen at -80 *'C. 

The TSAR- 12 library thus formed had a working titer of -2 x 
10'^ pfu/mli, 

10 6.3. Cliaracterizatlon of the TSAR- 9 and -12 Libraries 

The inserted synthetic oligonucleotides for each of the 
TSAR libraries, described in Sections 6.1 and 6.2 above, had a 
potential coding complexity of 20^* (-10*'') and 20^°, respectively, 
and since -10^* molecules were used in each transformation 

15 experiment, each member of these TSAR libraries should be unique. 
After plate amplification the library solution or stock has 10* 
copies of each member/mlj. 

It was observed that very few (<10%) of the inserted 
oligonucleotide sequences characterized so far in both of the 

20 libraries have exhibited deletions or insertions. This is likely 
a reflection of the accuracy assembling the oligonucleotides 
under the conditions used and the fact that certain types of 
mutations (i.e., frame-shifts) would not be tolerated as pIII an 
essential protein for phage propagation, 

25 In order to determine whether any coding bias existed in 

the variant non-predicted peptides expressed by these libraries, 
perhaps due to biases imposed in vitro during synthesis of the 
oligonucleotides or in vivo during expression by the reproducing 
phage, inserts were sequenced as set forth below. 

3 0 6.3 Characterization of TSAR- 9 Library 

Inserted synthetic oligonucleotide fragments of 23 
randomly chosen isolates were examined from the TSAR- 9 library. 
Individual plaques were used to inoculate I ml of 2XYT broth 
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containing E. coll (DHSaF' ) cells and the cultures were allowed 
to grow overnight at 37<>G with aeration. DNA was isolated from 
the culture supernatants according to Maniatis, supra. Twenty- 
three individual isolates were sequenced according to the method 
5 of Sanger ( Proc. Natl. Acad. Sci . USA (1979) 74:5463-5467) using 
as a primer the oligonucleotide 5 ' -AGCGTAACGATCTCCCG (SEQ ID NO, 
99) , which is 89 nucleotides downstream of the pIII gene cloning 
site of the m663 vector used to express the TSARS. 

Nucleotide sequences and their encoded amino acid sequences 
10 were analyzed with the MacVector computer program (IBI, New 

Haven, CT) . The Microsoft EXCEL program was used to evaluate 
amino acid frequencies. Such analyses showed that the nucleotide 
codons coding for and hence most amino acids, occurred at the 
expected frequency in the TSAR-9 library of expressed proteins. 
15 The notable exceptions were glutamine and tryptophan, which were over- 
and under-represented, respectively. 

It is of interest to note the paucity of TAG stop codons in 
the inserts, i.e., only 2 of -200 isolates characterized 
contained a TAG stop codon. About half [1- (47/48 ) of the 

2 0 phage inserts were expected to have at least one TAG codon in 

view of the assembly scheme used. However, most of the TAG- 
bearing phage appear to have been lost from the library, even 
though the bacterial host was sup£. This may be a consequence of 
suppression being less than 100% effective. 
25 The amino acids encoded by the inserted double stranded 

synthesized oligonucleotide sequences, excluding the fixed PG- 
encoding centers, were concatenated into a single sequence and 
the usage frequency determined for each amino acid using the 
Microsoft EXCEL program. These frequencies were compared to that 

3 0 expected from the assembly scheme of the oligonucleotides, and 

the divergence from expected values represented by the size of 
the bars above and below the baseline. Chi square analysis was 
used to determine the significance of the deviations. The 
majority of amino acids were found to occur at the expected 
3 5 frequency, with the notable exceptions that glutamine and 
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tryptophan were somewhat over- and under-represented, 
respectively. Thus, except for the invariant Pro-Gly, any 
position could have any amino acid; hence, the sequences are 
unpredicted or random. 

6.2.2. Characterization of TSAR- 12 Library 

Approximately 10 randomly chosen inserted 
oligonucleotides from the TSAR- 12 library were examined by DNA 
sequencing as described above in Section 6.3.1. The isolates 
were chosen at random from the TSAR-12 library and prepared for 
sequencing, as were the TSAR-9 isolates. Analysis showed that 
except for the invariant Gly any position could have any amino 
acid; hence, the sequences are unpredicted or random. 

6.4. Preparation of R8C Library 

Referring now to Figure 3, two oligonucleotides were 
synthesized on an Applied Biosystems Model 380a machine with the 
sequence 5 ' - 

TGACGTCTCGAGTTGTNNKNNKNNKNNI<1INKNN^ ' 
(SEQ ID N0:31) and 5 ' -GATCCTTCTAGATCC-3 ' (SEQ ID NO:32), where N 
is an equimolar ratio of deoxynucleotides A, C, G, and T, and K 
is an equimolar ratio of G and T. Fifty pmol of each 
oligonucleotide was incubated at 42 <>C for 5 min, then 37 for 
15 min, in 50 of Sequenase™ buffer (U.S. Biochemicals , 
Cleveland, OH) with O.l ^g/fih acetylated BSA, and 10 mM DTT. 
After annealing, 10 units of Sequenase'^ (U.S. Biochemicals) and 
0.2 mM of each dNTP were added and incubated at 37 *»C for 15 min. 
The sample was then heated at 65 <>C for 2 hr, digested with 100 
units of both Xho 1 and Xba I (New England BioLabs, Beverly, MA) , 
phenol extracted, ethanol precipitated, and resolved on a 15% 
non-denaturing polyacrylamide gel. The assembled, digested 
fragment was gel purified prior to ligation. The vector, m663 
(Fowlkes, D. et al . Biotech . (1992) 13:422-427), was prepared by 
digestion with Xho I and Xba I, calf alkaline phosphatase 
(Boehringer Mannheim, Indianapolis, IN) • treatment , phenol 
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extracted,- and purified by agarose gel electrophoresis . To 
ligate, 20 /ig vector was -combined with 0.2 /xg insert in 3 mL with 
T4 DNA ligase (Boehringer Mannheim) , according to the 
manufacturer. After removal of the protein and buffer by phenol 
5 extraction and ethanol precipitation, the ligated DNA was 

electroporated into XLl-Blue E. coli (Stratagene, San Diego, CA) 
and plated for eight hours at 3 7 ^C. To recover the recombinant 
phage, the top agar was collected with a spatula, mixed with an 
equal volume of 100 mM NaCl, 10 mM MgClj, and 50 mM Tris-HCI 

10 (pH7.5), and disrupted by two passes through an 18 -gauge syringe 
needle. The bacterial cells were removed by centrif ugation, and 
phage particles were collected by polyethylene glycol 
precipitation and stored at -70 *C in 25% glycerol. The library 
had 10® total recombinants and a working titer of 6 x 10^^ pfu/mL. 

15 Members of the library were checked for inserts by the 

polymerase chain reaction (Saiki, et al . Science (1988) 239:487- 
4 91) . Individual plaques on a petri plate were touched with a 
sterile toothpick and the tip was stirred into 2xYT with F* E. 
coli bacteria and incubated overnight at 37 °C with aeration. 

2 0 Five microliters of the phage supernatant were then transferred 
to new tubes containing buffer (67 mM Tris-HCl, pH 8.8/10 mM (3- 
mercaptoethanol/16 . 6 mM ammonium sulfate/6.7 mM EDTA/50 ^g bovine 
serum albumin per mL) , 0.1 mM deoxynucleotide triphosphates, and 
1.25 units of Tag DNA polymerase (Boehringer Mannheim, 

25 Indianapolis, IN) with 100 pmoles of oligonucleotide primers. 

The primers flanked the cloning site in gene III of m663 (5'- 
TTCACCTCGAAAGCAAGCTG-S' (SEQ ID NO: 100) and 
5' -CCTCATAGTTAGCGTAACG-3 ' (SEQ ID NO:101)). The assembly 
reactions were incubated at 94 for 1 min, 56 *>C for 2 min, and 

30 72 «C for 3 min; this cycle was repeated 24 times. The reaction 
products were then resolved by electrophoresis on a NuSieve 2.0% 
agarose gel (FMC, Rockland, ME) . Gels revealed that for 20 
plaques tested, all were recombinant and had single inserts of 
the expected size. 
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Based on the sample size of the library, it was anticipated 
that 100% of the recombinants had single inserts. However, all 
of the SH3 -binding phage isolated from the R8C library had 
double - inserts . Such phage are presumed rare (i.e., <5%) within 
5 the library, yet because the SH3 -binding peptide appears to need 
to be linear they were selected for by our screening methods. 
Most likely they were formed during the generation of the 
library; one scenario is that the inserts ligated together to 
form head-to-head dimers and that they were subsequently cloned 
10 into m663 DNA by ligation with the vector's Xho I sticky end and 
by illegitimate ligation with the vector's Xba J site (see. 
Figure 4 ) . 

6.5. Preparation Of Target-Coated Microtiter Wells 

S«5.1. Preparation Of GST-SH3 Fusion Proteins 

15 The preparation of Src-GST fusion protein was 

first described by Smith and Johnson, in Gene (1988) 67:31, the 
disclosure of which is incorporated by reference herein. 
Briefly, pGEX-derived (Pharmacia, Piscataway, NJ) constructs 
expressing GST fusion proteins containing the SH3 domains of Src, 

20 Grb2, Crk, Abl , or PLCy were obtained from Dr. Channing Der 
(University of North Carolina at Chapel Hill) ; a construct 
expressing the SH3 domain of Yes was obtained from Dr. Marius 
Sudol (Rockefeller University) . The use of the pGEX bacterial 
expression vector for the production of GST-SH3 fusion proteins 

25 is well-known to those in the art. See, e.g., Cicchetti, P. et 
al., in Science (1992) 257:803-806. Briefly, the coding region 
for a particular SH3 domain can be fused in- frame at the Bam HI 
site of pGEX-2T. Thus, fusion proteins were prepared as per the 
manufacturer's instructions, and quantified by Coomassie Blue 

30 staining of SDS-polyacrylamide gels. . Microtiter wells were 

coated with 5-20 /ig GST-SH3 fusion protein in 100 mM NaHCOa, pH 
8.5, blocked with 100 mM NaHCOa (pH 8.5) 1% BSA, and washed. All 
washes consisted of five applications of IXPBS, 0,1% Tween 20, 
0.1% BSA (Buffer A). Where appropriate, the amount of protein 
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bound to each well was quantified with an anti-GST antibody-based 
ELISA (Pharmacia, Piscataway, NJ) , and with a GST-binding phage, 
isolated during the course of this work. 

6.5.2. Coating of Hicrotiter Wells 

5 Bacterially expressed Src SH3 glutathione-S- 

transferase (Src-GST) fusion protein was purified from bacterial 
lysates using glutathione agarose 4B (Pharmacia) , according to 
the manufacturer's instructions. Bound Src-GST fusion protein 
was eluted from the glutathione agarose with 10 mM glutathione in 

10 PBS. Microtiter wells were then coated with Src-GST fusion 

protein (1-10 ;xg/well, in 50 mM NaHCOj, pH 8.5) overnight at 4 
*»C. To block non-specific binding of phage, 100 /iL IV BSA in 100 
mM NaHCOj, pH 8.5, was added to each well and allowed to incubate 
at room temperature for 1 hour. The wells were then washed five 

15 times with 200 PBS, 0.1% Tween 20, 0.1% BSA (Buffer A). 

6.6. Biopanning And Subsequent Characterization Of 
Phage--Displayed Random Peptide Libraries 
With Src-GST Fusion Protein As Target Molecule 

6.6.1. Isolation of Src SH3-Blnding Phage 

20 Library screens were performed as previously 

described. Kay, B.K., et al,, in Gene (1993) 128:59-65. 
Briefly, 1 X 10^^ pfu TSAR 9, TSAR 12, or R8C phage in Buffer A 
were incubated in a Src SH3 -GST- coated well for 2 hours. The 
wells were washed, and bound phage were eluted with 100 fiL 50 mM 

25 glycine -HCl (pH 2.2), transferred to a new well, and neutralized 
with 100 mL 200 mM NaHPO^ (pH 7,0) . Recovered phage were used to 
infect 1 X 10' DH5aF' E. coli cells in 20 mL 2xYT; the infected 
cells were grown overnight, resulting in a 1000- to 10,000-fold 
amplification of phage titer. Amplified phage were panned twice 

3 0 more, as above, excepting the amplification step. Binding phage 
recovered after the third round of panning were plated at a low 
density on a lawn of DH5aF' E. coli cells to yield isolated 
plaques for clonal analysis. Isolated plac[ues were used to 
produce small cultures from which phage stocks and DNA were 
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recovered for phage binding experiments and dideoxy sequencing 
(Sanger, F., et al., in Proc. Natl> Acad. Sci . USA (1977) 
74:5463-5467), respectively. Clones were confirmed as binding 
the SH3 domain by applying equal titers of phage to wells 
5 containing Src SH3-GST or GST alone, and titering the number of 

eluted particles from each well, or detecting bound phage with an 
anti-phage antibody-based ELISA (Pharmacia) . 

Indeed, the ability of isolated phage clones to bind to 
several SH3 domains derived from a variety of different proteins 
10 can be investigated by the manner described above. GST-SH3 
fusion proteins containing SH3 domains from a variety of 
different proteins are bound to microliter wells. An aliquot of 
the aforementioned phage stocks (50 ^L) is introduced into wells 
containing the different GST-SH3 fusion proteins. After room 
15 temperature incubation for 1-2 hours, the liquid contents of the 
microtiter plates are removed, and the wells are washed 5 times 
with 200 fill Buffer A. Bound phage are eluted with 100 50 mM 
glycine (pH 2.2), transferred to a new well, and neutralized with 
100 fiL 200 mM NaHPO^ (pH 7.0) . The phage are diluted 10"^- to 10' 
20 ^-fold, and aliquots are plated onto lawns of DHSaF' E. cold 
cells to establish the number of plaque forming units in the 
output sample. From these experiments, the relative specificity 
of different Src SH3 binding clones for SH3 domains derived from 
other proteins is determined. 

25 6.6.2. Phage EIiXSA flmd Nucleotide Sequencing 

To evaluate the binding of isolates to 
various targets proteins, enzyme -1 inked- immuno- as says (ELISA) 
were also performed. Bacterial cultures were infected with phage 
isolates and cultured overnight in 2XYT at 37 ^C. The cells were 

30 spun down and 25 mL of supernatant was added to microtiter plate 
wells coated with 50 mL of protein (1 mg/mL in 100 mM NaHCOj, pH 
8.4; overnight at 4 **C or for a few hours at room temperature) 
and blocked (1 mg/mL BSA in 100 mM NaHCOj, pH 8.4; for about one 
hour) . The phage are incubated in the well with 25 mL of 
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PBS-0.1% Tween 20 at RT for 2 hr. The wells are then washed 
multiple times over 30 minutes. To each well is added 50 /iL of 
polyclonal anti-phage antibody conjugated to horseradish 
peroxidase. The antibody is diluted 1:3000 in BSA-PBS-Tween 20; 
5 it was obtained from Pharmacia (Piscataway, NJ; catalog number 
27-9402-01) . After 30 minutes, the wells are washed again with 
BSA-PBS-Tween 20 for -20 minutes. Finally, 100 /iL of ABTS reagent 
(Pharmacia, with H3O2) are added to each well for the development 
of color. Plates are read with a plate reader (Molecular Devices, 

10 Menlo Park, CA) at 405 nm wavelength. 

The nucleotide sequence of the relevant segments of the Src 
SH3 binding clones (or phage clones that bind to SH3 domains of 
other proteins) were sequenced using standard methods. Sanger, 
F., et al., in Proc . Natl. Acad. Sci . USA (1977) 74:5463-5467. 

15 The oligo primer 5 ' -AGCGTAACGATCTAAA-3 ' (SEQ ID NO:102) was used, 
which is 8 9 nucleotides downstream of the gene III cloning site 
of M13 m666. The nucleotide sequences were analyzed with the 
MacVector computer program (IBI, New Haven, CT, USA) . From this 
nucleotide sequence information the primary sequence of each Src 

2 0 SH3 binding peptide was deduced. The corresponding synthetic 

peptides were then prepared by techniques well known in the art 
with or without flanking sequences. Indeed, these synthetic 
peptides have been shown to bind to SH3 domain targets, with 
those possessing the phage flanking amino acid residues 

2 5 exhibiting greater binding affinity. 

6.7 In Vitro Peptide Binding Assays 

Peptides were obtained from Research Genetics 
(Birmingham, AL) , Chiron Mimotopes (Victoria, Australia) , or 
synthesized by conventional techniques by Dr. J. Mark Carter of 
30 Cytogen Corporation (Princeton, NJ) . Peptide purity was assessed 
by HPLC and/or mass spectrometry. Biotinylated peptides were 
synthesized with either a KSGSG (SEQ ID NO: 103) or a GSGS (SEQ ID 
NO: 104) peptide linker (a spacer) between the biotin and the N- 
terminus of the peptide. Binding experiments were performed as 
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above, excepting the use of 10 peptide instead of phage. 
Bound biotinylated peptide was detected with streptavidin 
conjugated to alkaline phosphatase (Sigma Chemical Co., St. 
Louis, MO) - After one hour incubation period at room 
5 temperature, the wells were washed, and a solution of 3 mM p- 

nitrophenyl -phosphate (US Biochemicals, Cleveland, OH) in 5 0 mM 
NaCOj (pH 9,8), and 50 mM MgClj was added and color allowed to 
develop. Signals were read with an ELISA plate reader (Molecular 
Devices, Menlo Park, CA) at 4 05 nm wavelength. Binding 
10 experiments were performed in triplicate. The results are 
presented in Figures 7 and 8. 

6.8. Peptide Competition of GST-SH3 Affinity 
Precipitations of Cell JLysatea 

Labeled proteins are prepared by incubating a culture 

15 of HeLa cells overnight with alOO /xCi/mL "S -methionine . The 
cells are then washed and lysed with mild detergent. This 
mixture of radioactive proteins is incubated with Src-GST fusion 
protein that has been immobilized on glutathione- linked Sepharose . 
beads (Pharmacia, Piscataway, NJ) . After several hours of 

20 tumbling, the beads are pelleted gently by low-speed 

centrif ugation, and the supernatant is discarded. The beads are 
then resuspended into a slurry in PBS-0.1% Tween 20, pelleted, 
and washed several additional times. Finally, a 2% SDS solution 
is added to the sample, which is then boiled at 100 ®C for 3 

25 minutes. Afterward, the sample is centrif uged, and the 
supernatant loaded on a 10% polyacrylamide SDS gel for 
electrophoresis. After the proteins have been resolved, the gel 
is fixed, dried down, and exposed to X-ray film for 
autoradiography or phosphor plates for scanning by a Molecular 

3 0 Dynamics Phosphor Imager . 

The ability of Src SH3 to bind certain ^*S- labeled proteins 
is examined for competability with exogenous peptides. Synthetic 
peptides corresponding to phage -displayed inserts and motifs are 
. added at the time that the lysate is intubated with the Src-GST 
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fusion protein immobilized on glutathione-linked sepharose beads. 
The SH3 binding peptides block binding of all or some of the 
labeled proteins while negative control peptides (unrelated 
peptide sequences) do not. The amount of competition is 
quantified and correlated with the amount of added SH3 -domain 
binding peptides . 

Alternatively, NIH 3T3 cells were grown in Dulbecco's 
Modified Eagle Medium (DME) + 10% fetal calf serum (FCS) + 80 
^Ci/mL Tran"Slabel (ICN) , washed with PBS, lysed in RIPA buffer, 
and pelleted. Supernatant from 1.5 x 10* cells was precleared 
with 100 M9 glutathione-agarose-immobilized GST. The supernatant 
was then incubated with 10 glutathione-agarose-immobilized 
GST-SH3 fusion protein with or without added test peptide in a 
final volume of 250 /xL- Pelleted beads were washed with 1 mL 
each of RIPA, RIPA + 1% deoxycholate + 0.1% SDS, and PBS, 
resuspended in 50 a^L SDS-PAGE sample buffer, boiled, and 
subjected to SDS-PAGE (7.5%). Labeled proteins were detected by 
phosphorimaging (Molecular Dynamics) . The results are presented 
in Figure 9 . 

6.9. Peptide Competition of GST-SH3 Affinity Precipit- 
ations of PI-3' Kinase Prom Cell Lysatas 

It is possible to follow the precipitation of PI-3' 

Kinase by Src from cell lysates in the presence or absence of 

SH3 -binding peptides. HeLa cells are lysed with detergent and 

the protein mixtures are incxibated for several hours with the 

Src -GST fusion protein immobilized on glutathione-linked 

Sepharose beads. After several hours of tumbling, the beads are 

pelleted gently by low-speed centrif ugation and the supernatant 

is discarded. The beads are then resuspended into a slurry in 

PBS-0-1% Tween 20, pelleted, and washed several additional times. 

Finally, an SDS solution is added to the sample, which is then 

boiled at 100 for 3 minutes. Subsequently, the sample is 

centrif uged, and the supernatant is loaded on a 10% 

polyacrylamide SDS gel for electrophoresis. After the proteins 
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have been resolved, the gel is blotted to nitrocellulose or nylor 
(i.e., western blot). Th^ filter is then probed with a PI-3' 
Kinase antibody (monoclonal and polyclonal antibodies are 
available from Upstate Biotechnology Incorporated, Lake Placid, 
5 NY) and an enzyme -linked secondary antibody. The amount of PI-3' 
Kinase is then quantitated. 

The ability of Src SH3 to bind PI-3' Kinase is examined for 
competability with exogenous peptides. Synthetic peptides 
corresponding to phage -displayed inserts and motifs are added at 
10 the time that the lysate is incxibated with the Src-GST fusion 

protein that has been immobilized on glutathione-linked sepharose 
beads. Ten-fold and one hundred- fold molar excess of peptides 
are used relative to SH3 proteins. The SH3 binding peptides 
block binding of the PI-3' Kinase as detected on western blots 
15 while negative control peptides (unrelated peptide sequences) do 
not. The amount of competition is quantified and correlated with 
the amount of added SH3 -domain binding peptides, 

6.10. In Vivo Association Of SE3 -Binding Peptides 

With SH3 -Domains Of Proteins 

20 To demonstrate association of the SH3 -binding 

peptides with SH3 -domains of proteins inside cells, the SH3- 
binding peptides are tagged and localized in cells. For example, 
Bar-Sagi et al . , in Cell (1993) 74:83-91, have shown that SH3- 
binding proteins localize to the cytoskeleton when expressed in 

25 cells. Thus, the SH3 domain -binding peptides of the present 

invention can serve as cellular targetting signals (e.g., to the 
cytoskeleton) . Accordingly, the peptides are tagged with biotin 
and, subsequently, injected into cells. Alternatively, one can 
transfect into cells a recombinant plasmid that expresses a 

3 0 fusion protein comprising of the SH3 -binding peptide and the 

green fluorescent protein (GFP, Chalfie et al . , in Science (1994) 
263:802-805). The location of the biotinylated peptide or the 
GFP fusion protein is then assayed with FITC- labeled streptavidin 
in paraformaldehyde -fixed cells or by direct fluorescence in 

- 49 - 



BNSOOaO: <WO_9603e4eA1 JL;> 



wo 96/03649 



PCT/US95/09382 



living cells, respectively. Localization of the SH3 -binding 
peptides to the cytoskele,ton demonstrates that the SH3 -binding 
peptides can bind SH3 -domain proteins in vivo. In addition, 
focal adhesions, which are rich in Src, are also sites of 
5 potential subcellular localization of SH3 -binding peptides. 

Thus, NIH 3T3 fibroblasts were cultured in vitro on glass 
coverslips coated with fibronectin. After two days of growth at 
37 ^C, the cells were fixed for one hour at room temperature in 
the presence of 2% paraformaldehyde (pH 7.5). The coverslips 

10 were washed with PBS-O.IV Tween 20 several times to remove the 

fixative. Next, the coverslips were dipped into acetone (chilled 
at -20 *»C) for approximately 20 seconds and allowed to air-dry. 
The coverslips were washed again with PBS-0.1% Tween 20, 
containing BSA (1 mg/mL) , and incubated for 2 hours at room 

15 temperature with different biotinylated peptides in PBS-0.1% 

Tween 20. The coverslips were washed and then incubated with 1 
mg/mL streptavidin-Cy3 (Jackson Immunoresearch Co., West Grove, 
PA) for 1 hour at room temperature. Finally, the coverslips were 
washed in PBS-0.1% Tween 20, mounted in a glycerol solution on a 

20 glass slide, and viewed with a Nikon Optiphot epif luorescence 
microscope auid a 60x oil immersion lens. 

The results are presented in Figure 11, in which panel A 
displays cells stained with the conjugate biotin-spacer- 
VLKRPLPIPPVTR (SEQ ID NO:64); panel B exhibits cells stained with 

25 the conjugate, biotin-spacer-GILAPPVPPRNTR (SEQ ID NO:63) ; panel 
C shows cells stained with the long consensus peptide, biotin- 
spacer-RSTPRPLPPLPTTR (SEQ ID NO: 67) ; and panel D shows cells 
stained with the proline-rich vinculin peptide conjugate, biotin- 
spacer-LAPPKPPLPEGEV (SEQ ID NO: 70) . The V spacer" sequence is 

3 0 KSGSG (SEQ ID NO: 103) . As shown in Figure II, the panels in 
which SH3 domain-binding peptides were used present a bright 
display of fluorescence activity that is in sharp contrast to the 
relatively "dark" features of panel D (non-SH3 domain binding 
vinculin segment) , These results demonstrate further the ability 

3 5 of the SH3 domain-binding peptides of the present invention to 
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localize to protein targets (e.g., Src and Src-related proteins) 
within cells and provide an image thereof. 

6. II* In Vivo Modulation Of Src In Oocytes With 

SH3 -Binding Peptides 

5 When Xenopua laevie oocytes are injected with mRNA 

encoding deregulated Src, there are dramatic cytological and 

biochemical changes in the oocyte (Unger, T.F. and Steele, R.E., 

in MqI. C^ll, Bxgl. (1992) 12:5485-5498). The applicants have 

obtained the plasmid for generating Src mRNA, which is available 

10 from Dr. Robert Steele (University of California at Irvine) . 

Synthetic SH3 -binding peptides are injected into oocytes that 
have been previously injected with Src mRNA. The state of the 
cytoskeleton is inspected visually by observing the arrangement 
of cortical pigment granules under a dissecting microscope. The 

15 state of phosphorylation of several proteins is examined by 

western blotting with an anti-phosphotryosine monoclonal antibody 
(4G10; Upstate Biotechnology Incorporated), as described in Unger 
and Steele, above. 

6*12. Proges terone- induced laevia Oocyte 

20 Maturation 

Segments of adult ovary were removed surgically 

and incubated in 0.1% collagenase type D (Boehringer Mannheim^ 

Indianapolis, IN) in Ca^*-free 0R2 (82.5 mM NaCl , 2 . 5 mM KCl, 1.0 

mM MgCla, 1 , 0 mM NajHP04, 5 • 0 mM HEPES, and 3 . 8 mM NaOH, pH 7.6) . 

25 Oocytes were then washed 3-5 times with 0R2 containing 1 . 0 mM 

CaClj and allowed to recover in OR2 overnight at 18 ^'C. Stage VI 

oocytes were injected with 4 0 nL of 100 mM peptide or water. 

After injection, the oocytes were placed in OR2 with 2 mg/mL 

progesterone (Sigma, St Louis, MO) and incubated at 2 0 °C. 

. 30 Oocytes were scored at hourly time points for germinal vesicle 

brea)cdown (GVBD) . 

Figure 10 presents the results of this experiment. As shown 

by the graph, oocytes injected with the SH3 domain-binding 
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peptide VXiKRPLPIPPVTR (SEQ ID NO: 64) exhibit a faster* rate of 
progesterone -induced germinal vesicle breakdown relative to 
oocytes that had been injected with water or with the proline- 
rich vinculin peptide, LAPPKPPLPEGEV (SEQ ID NO: 70) . These 
results parallel those obtained by Unger and Steele, supra, who 
observed a gross alteration in the cortex of oocytes that had 
been injected with RNA encoding a deregulated Src variant 
("active" Src) versus those injected with RNA encoding the wild- 
type Src ("cellular" Src) . Also, as shown in Figure 3B of the 
Unger and Steele article, oocytes injected with deregulated or 
active Src RNA matured at a faster rate than oocytes injected 
with water or wild-type Src RNA. 

The present results obtained with Src SH3 domain-binding 
peptides suggest that these peptides modulate the biochemical 
activity of "cellular" Src; in particular, it is proposed that at 
least some of the Src SH3 domain-binding peptides of the present 
invention upregulate the biochemical activity of "cellular" Src, 
which may be downregulated or inhibited in its normal state. 
Hence, the administration of the SH3 domain-binding peptides of 
the present invention can constitute a novel method of modulating 
the activity of Src or Src-related proteins. Specifically, 
certain of these peptides are able to activate Src-family 
proteins . 

6«13* In Vivo Antagonism Of Src In Src Transformed 

Cells With SH3 -Binding Peptides 

The coding regions for SH3 -binding peptides are 
cloned into vectors that direct their expression in animal cells. 
A bipartite gene is constructed, encoding a protein with c-myc 
epitope and SH3 -binding peptide, which is transcribed from a 
strong constitutive promoter (e.g., SV40, CMV, HSV TK, 
calmodulin) . The vector is introduced into either normal or Src- 
transformed cells via transfection (e.g., electroporation, 
calcium phosphate, liposomes, DEAE dextran) . Transfected cells 
express the bipartite gene transiently dn culture. To create 
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stable transformed cell lines, the vector carries a selectable 
marker (e.g., neomycin resistance) or transfection is performed 
in the presence of excess plasmid carrying a selectable marker 
(e.g., neomycin resistance) and cells selected for the marker. 
5 Transfected cells are stained by immunofluorescence to detect 

expression of the bipartite protein. The hybridoma 9E10 secretes 
a monoclonal antibody that is highly specific for the c-myc 
epitope (EQKLISEEDLN [SEQ IDNO:105]; see, Evan, G.A. et al . , in 
Mol. Cell. Biol. (1985) 5:3610-3616), This antibody is used in 

10 immunofluorescence experiments to demonstrate that the bipartite 
protein is expressed inside the cells, and in some cases, 
localized to subcellular structures enriched in SH3 domain 
bearing proteins . 

There are several controls used in these experiments . 

15 First, cells are transfected with vectors that do not have the 
SH3-binding peptide coding region. Second, normal (non- 
transformed) cells are transfected. Third, cells transformed by 
oncogenes other than Src are used in the transfection : 
experiments. Fourth, cells are stained with other monoclonal 

20 antibodies that do not recognize the c-jnyc epitope. 

Transfected cells are examined for any changes in cell 
shape, behavior, and metabolism as a consequence of expressing 
the SH3 binding peptides. Cell shape is examined by phase 
contrast microscope at several times after transfection; in 

25 particular, the flatness of the cells, their adhesion to the 

substrate, and the degree of cell ruffling are monitored. Cell 
division rates, cell migration, and contact inhibition are also 
observed over time. Finally, the amount of phosphorylated 
tyrosine in transfected cells is quantitated by phosphoaminoacid 

30 analysis and with an anti-phosphotryosine monoclonal antibody 
(4G10; Upstate Biotechnology Incorporated) in western blotting 
experiments • 
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6.14 Prepsr&ticn cf PS2P Bi&ssd Peptide Libraries 
The preparation and characterization of preferred phage - 
displayed random peptide libraries have been described above in 
Sections 6.1 - 6.4. 
5 Using procedures similar to those described in these 

sections, oligonucleotide inserts were contructed according to 
the schematic provided in FIG. 12. The inserts were then cloned 
into the mBAX vector, and the biased peptide libraries were 
expressed as described previously. 

10 The mBAX vector was created in the Kay Laboratory by 

generating cloning sites in gene III of the M13mpl8 vector 
{Messing, J. (1991) . Cloning in M13 phage or how to use biology 
at its best. Gene 100, 3-12) in the manner of Fowlkes et al . 
(1992) (Bio techniques 13, 422-427) , The mBAX vector displays a 

15 peptide sequence at the N- terminus of the mature pi II protein 

that encodes the epitope for the mouse monoclonal antibody 7E11 
(see, FIG. 13) ; it includes the stop codon TAG in the coding 
region, which is suppressed in E, coli carrying suppressor tRNA 
gene mutations known as supE or supF. There are no other stop 

20 codons in the mBAX genome. The mBAX vector also carries a 

segment of the alpha fragment of S-galactosidase; bacterial cells 
expressing the omega fragment of S-galactosidase convert the 
clear XGal substrate into an insoluble blue precipitate. The 
plaques appear blue. 

25 Recombinant mBAX molecules can be distinguished easily from 

non- recombinant molecules due to the TAG codon in the Xhol - Xbal 
segment in gene III of mBAX. When recombinants are generated by 
replacing the Xho I - Xba I fragment with oligonucleotides 
encoding rauidom peptides, the recombinants can be grown in 

3 0 bacteria with (i.e., DH5crF' ) or without (i.e., JS5) suppressor 
tRNA mutant genes. On the other hand, the non- recombinant mBAX 
molecules fail to produce plaques on bacterial lawns where the 
bacteria (i.e., JS5) lack such suppressor genes. This is because 
in JS5, the TAG codon serves as a stop codon to yield a truncated 

35 pIII molecule during translation; since pIII is an essential 
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protein component of viable M13 viral particles, no plaques will 
form. 

6 •14.1 Dlscuasion 

The use of second generation or biased peptide libraries, 
which fix all or part of the RPLPPLP consensus motif for SH3 
binding peptides and randomize flanking residues, has defined 
additional sequence residues exhibiting selective SH3 binding. 

Tables 1-5, below, list the relevant amino acid sequences 
obtained from the biased peptide library for each set of SH3 
domain binding peptides. The underscored amino acid residues 
indicate the fixed positions. Also, indicated for each set of 
new binders is a "consensus" sequence, which seeks to include the* 
additional features gleaned from the new binding peptides. The 
symbol "0" represents a hydrophobic residue. 



TABLE 1 CORTACTIN SH3 -BINDING PEPTIDES 







SEQ. ID NO;. 


PXXP.CORT.Ml/2/3 .PP 


SSIiLGPPVPPKPQTLFSFSR 


107 


PXXP.C0RT.M4 .PP 


SRLGEFSKPPIPQKPTWMSR 


108 


PXXP.C0RT.N2 . PP 


SRTERP PLPQRPDWLS YS S R 


109 


PXXP . CORT . N3 . PP . INC 


SRE PDVJLCPNCPLLLRSDSR 


110 


PXXP.CORT. 01/2/3 . PP 


SSSSHNSRPPLPEKPSWLSR 


111 


PXXP. CORT. 04 .PP 


SRLTPQS KP PLP PKPS AVSR 


112 


CONSENSUS 


KPPi^PxKPxW 
R 


113 
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-Ti T t? 


NCK SH3 -BINDING PEPTIDES 








SEQ. ID NO. 


PXXP.NCK.Ql/4 .PP 


SSLGVGWKPLPPMRTASLSS 


114 


PXXP . NCK . Q2 /3 , PP . INC 


SSVGFADRPRPPLRVESLSR 


115 


PXXP . NCK . Rl . PP • INC 


SSAGILRPPEKPXRSFSLSE 


116 


PXXP. NCK. R2.PP 


SSPYTGDVPI PPLRGASLSE 


117 


PXXP. NCK. R3 .PP 


SSLMGSWPPVPPLRSDSLSE 


118 


PXXP. NCK •R4.PP 


SSIGEDTPPSPPTRRASLSE 


119 


PXXP. NCK. Sl/4 .PP 


SESLSEVSPKPPIRSVSLSE 


120 


PXXP . NCK , S2 , PP . INC 


SSVSEGYSPPLPPRSTSLSR 


121 


PXXP. NCK. S3. PP 


SSSFTLAAPTPPTRSLSLSS 


122 


PXXP, NCK. Tl.PP 


SSPPYELPPRPPNRTVSLSS 


123 


PXXP. NCK. T2 .PP 


SRWDGIAPPPPVRLSSI.SR 


124 


PXXP . NCK . T3 . PP . INC 


SSLGYSGAPVPPHRxSSLSE 


125 


PXXP. NCK. T4 .PP 


SS I SDYSRPPPPVRTLSLSS 


126 


CONSENSUS 


0xxxxx£xP£0RSxSL 
T 


127 
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TABLE 3 


ABL SH3 BINDING PEPTIDES 






SEQ. ID NO, 


PXXP.ABL.Gl/2 . PP 


SRGPRWSPPPVELPTSLDSS 


126 


PXXP,ABL.G3/4 . PP 


SSPPDYAAEAIPSSLWVDSR 


129 


PXXP . ABL . Hl/3 /4 . PP 


SSPPHWAPEAP£AMSPP I SE 


130 


PXXP . ABL . H2 , PP , INC 


SSDRCWEC£PW£AGGQRGSR 


131 


PXXP. ABL. 11/2/3 .PP 


SSPPKFS P£P P PYWQLHASfi 


132 


PXXP. ABL. 14 .PP 


SSPPSFAPJEAAPPRHS FGSR 


133 


PXXP. ABL. Jl.PP 


SSAPKKPAPPVEMMAHVMSS 


134 


PXXP . ABL . J2 • PP . INC 


SSPTYPPP£PPBTAKGASR 


13 5 


PXXP . ABL . J3 . PP . INC 


SSPPXXXP£PIPNSPQVLSR 


136 ' 


PXXP.ABL. J4 .PP 


SSPPTWTPEKP£GWGVVFSR 


137 


PXXP. ABL. LI .PP 


SSAPTWSP£AL£NVAKYKSB 


138 


PXXP. ABL. L2 /3 .PP 


SSIKGPRF£VP£VPIiNGVSB 


139 


PXXP. ABL. L4.PP 


SSPPAWSP£HR£VAFGSTSR 


14 0 


CONSENSUS 


PPxWxPPP0P 


141 


TABLE 4 PLCG SH3 -BINDING PEPTIDES 








SEQ. ID NO. 


PXXP . PLCG . PI . PP 


SSMKVHNF£LPELPSyETSE 


142 


PXXP . PLCG . P2 . PP 


SRVPPLVA£RP£STLNSLSR 


143 


PXXP . PLCG . PE . PP . INC 


SSLYWQHG£DP£VGAPQLSE 


144 


PXXP . PLCG . P4 . PP 


SSHPIiNSWPGG£FRHNLSSB 


145 
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TABLE 5 


SRC SH3 -BINDING PEPTIDES 






SEQ. ID NO. 


PXXP . SRC * Al . PP 


SSRALRVRPLPPVPGTSLSR 


146 


PXXP - SRC . A2 . PP 


SSFRALPL£PTPDNPFAGSR 


147 


PXXP . SRC • A3 . PP 


SSDAPGSL£FR£LPPVPTSB 


148 


PXXP . SRC - A4 . PP 


§SISQRAL£PLPLMSDPASS 


149 


PXXP . SRC - Bl . PP 


SS PAYRPL£RLPDLS VI y SE 


150 


PXXP, SRC, B2/3/PP 


SSFINRRL£AL£PDNSLLSR 


151 


PXXP. SRC. B4 .PP 


SRLTGRPLPALPPPFSDFSR 


152 


PXXP , SRC , CI . PP 


SEMKDRVL£P I£TVES AVSB 


153 


PXXP . SRC . C2 . PP . INC 


S£LYSAIA£DP£PRNSSSSR 


154 


PXXP . SRC , C3 . PP 


SSLASRPL£LL£NSAPGQSB 


155 


PXXP. SRC. Dl .PP 


S£LTSRPL£D I£VRPS KSSR 


156 


PXXP , SRC . D2 . PP . INC 


SSI*KWRAL£PLEETDTPYSE 


157 


PXXP. SRC. D3 .PP 


SSNTNRLP£PT£DGLDVRSR 


158 


PXXP . SRC , D4 , PP 


SSLOSRPLPIiPPQSSYPISR 


159 


CONSENSUS 


RPLPPLP 


160 



It should be apparent to one of ordinary skill 
that many other embodiments of the present invention 
can be contemplated beyond the preferred embodiments 
described above but which other embodiments 
5 nevertheless fall within the scope and spirit of the 

present invention. Hence, the present invention should 
not be construed to be limited to the preferred 
embodiments described herein, which serve only to 
illustrate the present invention, but only by the 
10 claims that follow. 

Also, numerous references are cited throughout the 
specification. The complete disclosures of these 
references are incorporated by reference herein. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: KAY, BRIAN K. 

SPARKS, ANDREW B. 
THORN, JUDITH M. 
QUILLIAM, LAWRENCE A, 
DER, CHANNING J. 

(ii) TITLE OF INVENTION: Src SH3 BINDING PEPTIDES AND METHODS OF 
ISOLATING AND USING SAME 

(iii) NUMBER OF SEQUENCES: 106 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pennie & Edmonds 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036-2711 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

<C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 
(A) NAME: S. Leslie Misrock 
<B) REGISTRATION NUMBER: 18,872 
(C) REFERENCE/DOCKET NUMBER: 1101-192 

(ix) TELECOMMUNICATION INFORMATION: 
<A) TELEPHONE: (212) 790-9090 
<B) TELEFAX: (212) 869-9741/8864 

(C) TELEX: 66141 PENNIE 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /note= "hydrophobic residue (such 
as Pro or Leu ) " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Xaa Pro Xaa Xaa Pro Pro Pro Xaa Xaa Pro 
15 10 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 
(6) TYPE: amino acid 
( D ) TOPODOGY : xin3cnown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1. . 3 

(D) OTHER INFORMATION: /note= "Xaa ^ any residue other 
than Cys" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note« "X = any residue other 
than Cys" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8.. 9 

(D) OTHER INFORMATION: /note= "X = any residue other 
than Cys" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Xaa Xaa Xaa Pro Pro Xaa Pro Xaa Xaa 
1 5 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARA<rrERISTICS : 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: un)cnown 

(ii) MOL£CULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 ^ ^ ^ w « 
(D) OTHER INFORMATION: /note= "X tends to be hydrophobic 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 8.. 9 ^ ^ - „ 
(D) OTHER INFORMATION: /note= "X tends to be hydrophobic 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Arg Xaa Leu Pro Pro Arg Pro Xaa Xaa 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

( D ) TOPOliOGY : vinkno wn 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Arg Lys Leu Pro Pro Arg Pro Arg Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTTH: 10 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

{ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Val Pro Pro Pro Val Pro Pro Arg Arg Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEINGTH: 7 amino acids 

(B) TYPE: amino acid 

< D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHKR INFORMATION: /note= "tends to be Pro" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "hydrophobic residue" 

(ix) FEATURE: 

(A) NAMB/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "tends to be Pro" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Xaa Xaa Xaa Pro Xaa Xaa Pro 
1 5 
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(2) INFORMATION FOR SEO ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: • 
<A} LENGTH : 9 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "X = any residue other 
than Cys" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 8.. 9 

(D) OTHER INFORMATION: /note= -X = any residue other 
than Cys" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Arg Xaa Leu Pro Pro Arg Pro Xaa Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 9 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note* "hydrophobic residue" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Arg Xaa Leu Pro Pro Leu Pro Arg Phe 
1 5 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECTTLE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Arg Pro Leu Pro Pro Leu Pro 
15 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: * 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
<D) TOPOtiOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-aite 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "any residue other than Cys" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 5, .6 

<D) OTHER INFORMATION: /note= "any hydrophobic residue" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note= "any hydrophilic residue 
^cept Cys" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Arg Xaa Leu Pro Xaa Xaa Pro Xaa Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 9 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unJoiown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Arg Pro Leu Pro Pro Leu Pro Thr Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH; 4 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: C- terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Xaa Xaa Xaa Xaa 
1 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 5 amino acids 

(B) TYPE: amine acid 
(D) TOPOIjOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRA04ENT TYPE: C- terminal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Xaa Xaa Xaa Xaa Xaa 
1 5 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: N- terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Xaa Xaa Xaa Xaa 
1 

(2) INFORMATION FOR SEQ ID NO:15t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1..3 ^ . 
(D) OTHER INFORMATION: /note- "any residue other than Cys 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 . . ^ - 
(D) OTHER INFORMATION: /note- "any residue other than Cys 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8,. 9 ^ ^. n 
(D) OTHER INFORMATION: /note- "any hydrophobic residue 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 11.. 13 

(D) OTHER INFORMATION: /note- "any residue other than Cys 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Xaa Xaa Xaa Arg Xaa Leu Pro Xaa Xaa Pro Xaa Xaa Xaa 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE : amino acid 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 

Xaa Xaa Xaa Xaa Pro Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Arg 
35 40 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGCTCGAGNN NBNNBNNBNN BNNBNNBNNB NNBNNBNNBN NBNNBNNBNN BNNBNNBNNB 6 0 

NNBCCAGGT 6 9 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GGTCCAITMVN NVNNVNNVNN VNNVNNVNNV NNVNNVNNVN NVNNVl^NVNN VNNVNNVNNV 6 0 

AGATCriGG " 6 8 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TCGAGNNNBN NBNNBNNBNN BNNBNNBNNB NNBNNBNNBN NBNNBNNBNN BNNBNNBNNB 
CCAGGT 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GGTCCANNVN NVNNVNNVNN VNNVNNVNNV NNVNNVNNVN NVNNVNNVNN VNNVNNVNNV 
AGATC 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N- terminal 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "Arg or Ser" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 10 15 

Xaa xaa Xaa Xaa Pro Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 

Xaa xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Arg Pro Ser Arg Thr 
35 40 45 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Ser His Ser Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Arg Pro Ser Arg 
35 40 45 

Thr 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "Ser or Thr" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-si te 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /note= "Ser, Arg, Gly, Cys or Trp" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /note= "Val , Ala, Asp, Glu or Gly" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa 
15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Arg 
20 25 

( 2 ) INFORMATION FOR SEQ ID NO : 24 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DBSCRIPTION: SEQ ID NO:24: 
TTTTGTCGAC NNNBNNBHNB NNBNNBNNBN NBNNBNNBNN BNGCGGTG 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
CGCCACNVNN VNNVNNVNNV NNVNNVNNVN NVNNVNNVTG ATCATTTT 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DBSCRIPTION: SEQ ID NO: 26: 
CGCCACNVNN VNNVNNVNNV NNVNNVNNVN NVNNVNNVTG ATC 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TCGACNNNBN NBNNBNNBNN BNNBNNBNNB NNBNNBNGCG GTG 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(V) FRAGMENT TYPE: N- terminal 
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(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note» "Ser or Thr" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
<B) I/OCATION: 13 

<D) OTHER INFORMATION: /note= "Ser, Arg, Gly, Cys or Trp" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /note= "Val, Ala, Asp, Glu or Gly" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa 
15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Arg Pro Ser Arg Thr 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

{D> OTHER- INFORMATION: /note= "Ser or Thr" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- si te 

(B) LOCATION: 16 

(D) OTHER INFORMATION: /note= "Ser, Arg, Gly, Cys or Trp" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /note= "Val , Ala, Asp, Glu or Gly" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29; 

Ser His Ser Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 

Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Arg Pro Ser 
20 25 -30 

Arg Thr 
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(2) INFORMATION FOR SEQ ID . NO: 30: 

(i) SEQUENCE CHARACTERISTICS: • 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOIiOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
TGACGTCTCG AGTTGTNNKN NKNNKNNKNN KNNKNNKNNK TGTGGATCTA GAAGGATC 58 



(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE; DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CCTAGATCTT CCTAG 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
TCGAGTTGTN NKNNKNNKNN KNNKNNKNNK NNKTGTGGAT 4 0 
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(2) INFORMATION FOR SBQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: * 

(A) LENGTH: 4 0 base pairs 

(B) TVPE: nucleic acid 

(C) STRANDEDNESS : doxoble 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CAACANNMNN MNNMNNMNNM NNMNNMNNMA CACCTAGATC 4 0 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N- terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Ser Ser Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Gly Ser Arg Pro 
1 5 10 15 

Ser Arg Thr 



(2> INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Ser Ser Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Gly Ser Arg Ser 
15 10 IS 

Thr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Thr Arg 
20 25 



(2) INFORMATION FOR SBQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
TCGAGTTGTN NKNNKNNKNN KNNKNNKNNK NNKTGTGGAT CTAGATCCAC AVNNVNNVNN 60 
VNNVNNVNNV NNVNNACAAC ®^ 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IjENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOIiOGY: un3cnown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Ser Ser Phe Asp Gin Gin Asp Trp Asp Tyr Ser lie Ala Glu Lys Met 
15 10 15 

His Pro lie Arg Pro Gly Phe Arg Glu Leu Pro Pro Leu Pro Pro Ser 
20 25 30 

Arg Ala Ser Phe Gly Gly Gly Ala Ser Arg Pro Ser Arg 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Ser Thr Asn Val Trp Val Thr Gly Ser Val lie Ala Arg Gly Ala Gin 
15 10 15 

Ser Arg Pro Leu Pro lie Pro Pro Glu Thr Arg Pro Ser Arg 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknovm 

(ii) MOLECULE TYPE: peptide 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 40: 

Ser Thr Ala Pro Trp Gly Leu Arg Val Ala His Glu Gly Gly Val Leu 
15 10 15 

Lys Arg Pro Leu Pro lie Pro Pro Val Thr Arg Pro Ser Arg 
20 25 30 
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(2) INFORMATION FOR SEQ ID NO; 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 amino acids 

(B) TYPE : amino acid 

( D ) TOPOLfOG Y : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Ser Ser Ser Gly Tyr Val Val Pro Lys Arg Leu Gly Asp Met Arg Glu 
15 10 15 

Tyr Asn Ala His Pro Gly Leu His Val Pro Pro Asn Ser Pro Leu Pro 
20 25 30 

Pro Leu Pro Thr His Leu Gin Ser Ser Arg Pro Ser Arg 
35 40 45 



(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Ser Ser Arg Gly Glu Gly Asn Asn lie He Ser Ser Arg Pro Phe Leu 
15 10 15 

Ser Asn Ser Asp Pro Gly Val Ser Asn Lys Leu Thr Gly Arg Gly Pro 
20 25 30 

Leu Pro Pro Leu Pro Asn Asp Ser Arg Pro Ser Arg 
35 40 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: xmknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Ser Thr Ala Val Ser Phe Arg Phe Met Pro Gly Gly Gly Gly Ala Phe 
1 5 10 15 

Tyr Ser Thr Arg Pro Val Pro Pro He Thr Arg Pro Ser Arg Thr 
20 25 30 
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(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOliOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Ser Thr Ala His Ser Leu Trp Asp Trp Gly Thr Phe Ser Gly Val Ser 
15 10 15 

His Lys Ser Arg Leu Pro Pro Leu Pro Thr Arg Pro Ser Arg Thr 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Pro Gly Tyr Ala Arg lie Val Ser Tyr Arg Phe Arg Ala Leu Pro Ser 
15 10 15 

Pro Pro Ser Ala Ser Arg Pro Ser Arg 
20 25 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Ser Thr Asn Asp Val Asp Trp Met His Met Trp Asn Ser Gly Gly Pro 
15 10 15 

His Arg Arg Leu Pro Pro Thr Pro Ala Thr Arg Pro Ser Arg 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:47: 

Ser Ser Asp Asn Trp Ala Arg Arg Val His Ala Ser Glu Leu lie Tyx 
15 10 15 

Thr Asp Leu Ser Pro Gly lie Leu Leu Ala Gin Arg Gin Leu Pro Pro 
20 25 30 

Thr Pro Gly Arg Asp Pro Ser His Ser Arg Pro Ser Arg 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 4 5 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Ser Ser Glu Ser Pro Leu Met Tyr Asn Arg Val Gly Ala Leu Gin Ser 
15 10 15 

Leu Thr Ser Val Pro Gly Ser Met Met His Phe Ala Leu Gin Arg Arg 
20 25 30 

Leu Pro Arg Thr Pro Pro Pro Ala Ser Arg Pro Ser Arg 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Ser Thr Arg Trp Ser His Ser Trp Pro Gly Tyr Val Gly Gly Ala Asn 
15 10 15 

Pro Ser Pro Ala Thr Arg Pro Leu Pro Thr Arg Pro Ser Arg Thr Val 
20 25 30 

Glu Ser Cys 
35 

(2) INFORMATION FOR SEQ ID NO: 50: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Ser Arg Tyx Asn Asp Leu Gly Thr Arg Pro Val Ser Glu Val lie Lys 
15 10 15 

Tyr Asp Tyr Phe Pro Gly Tyr Ser Gin His Val lie Thr Pro Asp Gly 
20 25 30 

Ser Tyr Ser Thr Arg Pro Leu Pro Ser Arg Pro Ser Arg Thr Val Glu 
35 40 45 

Ser Cys 
50 

(2) INFORMATION FOR SEQ ID NO: 51: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Pro Gly Arg Leu Leu Leu Pro Ser Glu Pro Arg Thr Phe Tyr Asn Tyr 
1 5 10 15 

Gly His Asp Ser Arg Pro Ser Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

Ser Thr Met Tyr Gly Val Ser Trp Leu Ser Ser Gly Ser Gly Gly lie 
15 10 15 

Leu Ala Pro Pro Val Pro Pro Arg Asn Thr Arg Pro Ser Arg 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

< D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

- 76 - 



BNSOOaD: <WO 9e0364flA1 1 > 



wo 96/03649 



PCT/US95/09382 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

Ser Ser Cys Thx Glu Lys Thr Val Ser Gly Trp Cys Gly Ser Arg Ser 
15 10 15 

Thr Pirg Pro Leu Pro lie Leu Pro Arg Thr Thr Arg Pro Ser Arg 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Ser Ser Cys Met Leu Pro Thr Asp Gly Trp Gin Cys Gly Ser Arg Ser 
15 10 15 

Thr Pro Arg Pro Leu Pro Met Leu Pro Thr Thr Arg Pro Ser Arg 
20 25 30 

(2) INFORMATION FOR SEQ ID NO : 55 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Ser Ser Cys Asp Gly Thr Gin Phe Arg Leu Asn Cys Gly Ser Arg Ser 
1 5 10 15 

Thr Asn Arg Pro Leu Pro Met lie Pro Thr Thr Arg Pro Ser Arg 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56; 

Ser Ser Cys Met Gin Gly Gin Ala Gly Leu Lys Cys Gly Ser Arg Ser 
1 5 10 15 

Thr Arg Pro Leu Pro Ser Leu Pro lie Thr Thr Arg Pro Ser Arg 
20 25 30 
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(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Ser Ser Cys Tyr Arg Glu Lys Asp Thr Trp Gly Cys Gly Ser Arg Ser 
15 10 15 

Thr Ser Arg Pro Leu Pro Ser Leu Pro Thr Thr Arg Pro Ser Arg 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Ser Ser Cys Leu Phe Glu Gin Gly Ala Gly Thr Cys Gly Ser Arg Ser 
15 10 15 

Thr Arg Ser Leu Pro Pro Leu Pro Pro Thr Thr Arg Pro Ser Ser Arg 
20 25 30 



(2) INFORMATION FOR SEQ ID. NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Ser Ser Cys Asp Thr Gly Arg lie Ala Pro Gly Cys Gly Ser Arg Ser 
1 5 10 15 

Thr Pro Arg Pro Leu Pro Leu lie Pro Thr Thr Pro Arg Ser Thr Asn 
20 25 30 

Leu Asn Leu Thr Ser Thr Thr Thr Arg Pro fier Arg 
35 40 
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(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Ser Ser Cys Gly Leu Asp Asn Ala Ala Lys Thr Cys Gly Ser Arg Ser 
15 10 15 

Thr Arg Pro Leu Pro Pro Thr Pro Leu Thr Thr Arg Pro Ser Arg 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Ser Ser Cys Ser Arg Ala His Glu Thr Glu Met Cys Gly Ser Arg Ser 
1 5 10 15 

Thr Arg Pro Gin Pro Pro Pro Pro lie Thr Thr Arg Pro Ser Arg 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 62: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Arg Ser Thr Pro Arg Pro Leu Pro Met Leu Pro Thr Thr Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Gly lie Leu Ala Pro Pro Val Pro. Pro Arg Asn Thr Arg 
15 10 



xr4r wiuoMx a.v./i>i fwrt ociw -^''^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Val Leu Lys Arg Pro Leu Pro lie Pro Pro Val Thr Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unJcnown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Gly Pro His Arg Arg Leu Pro Pro Thr Pro Ala Thr Arg 
1 5 10 



(2) INFORMATION FOR SEQ ID NO:66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: xinknovm 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Ala Asn Pro Ser Pro Ala Thr Arg Pro Leu Pro Thr Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : untoown 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Arg Ser Thr Pro Arg Pro Leu Pro Pro Leu Pro Thr Thr Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Gly Thr Val Glu Pro Val Pro Pro Pro Val Pro Pro Arg Arg Arg Pro 
15 10 15 

Glu Ser Ala 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 69: 

Lys lie Ser Pro Pro Thr Pro Lys Pro Arg Pro Pro Arg Pro Leu Pro 
15 10 15 

Val 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Leu Ala Pro Pro Lys Pro Pro Leu Pro Glu Gly Glu Val 
15 10 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "any residue other thcUi Cys" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-sice 

(B) LOCATION: 5.. 6 

(D) OTHER INFORMATION: /note= "any hydrophobic residue" 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 71: 

Arg Xaa Leu Pro Xaa Xaa Pro 
1 5 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : un3cnown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Pro Gly Phe Arg Glu Leu Pro Pro Leu Pro Pro Ser Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Ala Gin Ser Arg Pro Leu Pro lie Pro Pro Glu Thr Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : tinknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 

Pro Pro Asn Ser Pro Leu Pro Pro Leu Pro Thr His Leu 
15 10 
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C2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: < 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Thr Gly Arg Gly Pro Leu Pro Pro Leu Pro Asn Asp Ser 
15 10 



(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Tyr Ser Thr Arg Pro Val Pro Pro He Thr Arg Pro Ser 
15 10 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Ser His Lys Ser Arg Leu Pro Pro Leu Pro Thr Arg Pro 
15 10 



(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Tyr Arg Phe Arg Ala Leu Pro Ser Pro Pro Ser Ala Ser 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
iO) TOPOLiOGY: unioiown 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Leu Ala Gin Arg Gin Leu Pro Pro Thr Pro Gly Arg Asp 
15 10 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Ala Leu Gin Arg Arg Leu Pro Arg Thr Pro Pro Pro Ala 
15 10 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Pro Ala Thr Axg Pro Leu Pro Thr Arg Pro Ser Arg Thr 
15 10 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: xinknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Tyr Ser Thr Arg Pro Leu Pro Ser Arg Pro Ser Arg Thr 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE C2^ARACTERISTICS : 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Xaa Pro Gly Arg lie Leu Leu Leu Pro Ser Glu Pro Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 13 amino acids 

(B) TYPE: amino acid 

{ D ) TOPOLOGY : untaown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO; 84: 

Ser Gly Gly lie Leu Ala Pro Pro Val Pro Pro Arg Asn 
15 10 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: ami.no acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Arg Ser Thr Arg Pro Leu Pro lie Leu Pro Arg Thr Thr 
15 10 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Ser Thr Pro Arg Pro Leu Pro Met Leu Pro Thr Thr Arg 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TO POLfOGY : unknown 

(ii) MOLiECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Ser Thr Asn Arg Pro Leu Pro Met lie Pro Thr Thr Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Arg Ser Thr Arg Pro Leu Pro Ser Leu Pro lie Thr Thr 
15 10 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : un3cnown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Ser Thr Ser Arg Pro Leu Pro Ser Leu Pro Thr Thr Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Arg Ser Thr Arg Pro Leu Pro Ser Leu Pro He Thr Thr 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: * 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Arg Ser Thr Arg Gin Leu Pro lie Pro Pro Thr Thr Thr 
15 10 



(2) INFORMATION FOR SEQ ID NO : 92 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Ser Thr Pro Arg Pro Leu Pro Leu lie Pro Thr Thr Pro 
15 10 



(2) INFORMATION FOR SEQ ID NO:93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

Arg Ser Thr Arg Pro Leu Pro Pro Thr Pro Leu Thr Thr 
15 10 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 94 : 

Arg Ser Thr Arg Pro Gin Pro Pro Pro Pro lie Thr Thr 
IS 10 
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(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: * 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

Arg Pro Leu Pro Met Leu Pro 
1 5 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: S 

(D) OTHER INFORMATION: /note= "Pro or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Arg Pro Leu Pro Xaa Leu Pro 
1 5 



(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

Arg Ser Thr Pro 
1 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

Arg Ser Thr Pro Ala Pro Pro Val- Pro Pro Arg Thr Thr Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
AGCGTAACGA TCTCCCG 



(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
TTCACCTCGA AAGCAAGCTG 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
CCTCATAGTT AGCGTAACG 



(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

( D ) TO POLOG Y : 1 ine ar 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
AGCGTAACGA TCTAAA 



(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION; SEQ ID N0:103: 

Lys Ser Gly Ser Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
Gly Ser Gly Ser 



(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

Glu Gin Lys Leu lie Ser Glu Glu Asp Leu Asn 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOIjOGY: unknown 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:106: 

Ser Ser Cys Asp His Thr Leu Gly Leu Gly Trp Cys Gly Ser Arg Ser 
15 10 15 

Thr Arg Gin Leu Pro lie Pro Pro Thr Thr Thr Arg Pro Ser Arg 
20 25 30 
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WHAT IS CLAIMED IS: 

1. A peptide having at least nine and up to forty- five 
amino acid residues, including an amino acid sequence of the 
formula, P,-2-L-P-5-6-P-8-9 (SEQ ID NO? 10) , positioned anywhere 
along the peptide, in which each number represents an amino acid 

5 residue, such that 2 represents any amino acid residue except 
cysteine, 5 and 6 each represents a hydrophobic amino acid 
residue, 8 represents any amino acid residue except cysteine, and 
9 represents a hydrophilic amino acid residue except cysteine, 
each letter being the standard one- letter symbol for the 
10 corresponding amino acid, said peptide exhibiting a binding 

affinity for the SH3 domain of Src, provided that said peptide is 
not R-P-L,-P-P-L-P-T-S (SEQ ID NO:ll) . 

2. The peptide of claim 1 in which 2 is a P, R, A, L, Q, E 

or S . 

3. The peptide of claim 1 in which 5 is a P, M, I or L . 

4. The peptide of claim 1 in which 6 is a P, L, I or V. 

5. The peptide of claim 1 in which 8 is a T, R, P, I, N, 
E, V, S, A, G or L. 

6. The peptide of claim 1 in which 9 is a T, R, S, H or D. 

7. The peptide of claim 1 which further comprises a 

C- terminal -flanking amino acid sequence of the formula 10, 10-11, 
10-11-12, 10-11-12-13 (SEQ ID N0:12) or 10-11-12-13-14 (SEQ ID 
NO: 13), in which each number represents any amino acid residue 
5 except cysteine, such that 10 is bound to 9 by a peptide bond. 

8. The peptide of claim 7 in which 10 is T, R, L, S, D, P, 
A or N. 

9. The peptide of claim 7 in which 11 is R, P, A, Q, S or 

T. 

10. The peptide of claim 7 in which 12 is P, S, R or T. 

11. The peptide of claim 7 in which 13 is P, S, R, F, H or 

T. 

12. The peptide of claim 7 in which 14 is S, R, G or T. 

13. The peptide of claim 1 which further comprises an N- 
terminal- flanking amino acid sequence of the formula 1', 2'-l', 
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3'-2'-l' or 4'-3'-2'-l' (SEQ ID N0:14) in which each number 
represents any amino acid .residue except cysteine, such that 1' 
5 is bound to R by a peptide bond. 

14. The peptide of claim 13 in which 1' is T, P, S, N, F, 
W, K, H, Q or G. 

15. The peptide of claim 13 in which 2' is S, T, G, P, R, 
Q, L, A or H. 

16. The peptide of claim 13 in which 3' is R, S, P, G, A, 
V, y or L. 

17. The peptide of claim 13 in which 4' is R, S, V, T, G, L 

or F. 

18. A peptide having at least thirteen and up to forty-five 
amino acid residues, including an amino acid sequence of the 
formula, 3 ' -2 ' - 1 ' -R-2-L-P-5-6 -P-8-9-10 (SEQ ID NO:15), positioned 
anywhere along the peptide, in which each number represents an 

5 amino acid residue, such that 3', 2', 1'. 2, 8, and 10 each 

represents any amino acid residue except cysteine, 5 and 6 each 
represents a hydrophobic amino acid residue, and 9 represents a 
hydrophilic amino acid residue except cysteine, each letter being 
the standard one -letter symbol for the corresponding amino acid, 
10 said peptide exhibiting a binding affinity for the SH3 domain of 
Src . 

19. The peptide of claim 18 in which 5 is a P or M. 

20. The peptide of claim 18 in which 1' is T, P, S or N. 

21. The peptide of claim 18 in which 2' is S or T. 

22. The peptide of claim 18 in which 3' is R or S . 

23. The peptide of claim 18 in which 10 is T or R. 

24. The peptide of claim 1 the binding affinity of which is 
at least three -fold greater than that exhibited by the peptide 
RPLPPLP for the SH3 domain of Src. 

25. The peptide of claim 18 the binding affinity of which 
is at least three- fold greater than that exhibited by the peptide 
RPLPPLP (SEQ ID NO: 9) for the SH3 domain of Src. 
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26. The peptide of claim 1 the binding affinity of which is 
at least four- fold greater than that exhibited by the peptide 
RPLPPLP (SEQ ID NO: 9) for the SH3 domain of Src . 

27 The peptide of claim IS the binding affinity of which 
is at least four- fold greater than that exhibited by the peptide 
RPLPPLP (SEQ ID NO: 9) for the SH3 domain of Src, 

28. The peptide of claim 1 which further exhibits a general 
binding affinity for the SH3 domain of Abl, Grb2, PLC- 6, PLC-7, 
Ras GAP, Nek, p85 PI-3' Kinase, and proteins related thereto. 

29. The peptide of claim 1 which exhibits a selective 
binding affinity for the SH3 domain of Src and Src-related 
proteins . 

30. The peptide of claim 18 which further exhibits a 
general binding affinity for the SH3 domain of Abl, Grb2, PLC- 6, 
PLC-Y, Ras GAP, Nek, p85 PI -3 'Kinase, and proteins related 
thereto . 

31. The peptide of claim 18 which exhibits a selective 
binding affinity for the SH3 domain of Src and Src-related 
proteins. 

32. A peptide having the amino acid sequence RSTPRPLPMLPTTR 
(SEQ ID NO: 62) . 

33. A peptide having the amino acid sequence RSTPRPLPPLPTTR 
(SEQ ID NO: 67) . 

34 . A peptide having the amino acid sequence GILAPPVPPRNTR 
(SEQ ID NO: 63) . 

35. A peptide having the amino acid sequence VLKRPLPIPPVTR 
(SEQ ID NO:64) . 

36. A peptide having the amino acid sequence GPHRRLPPTPATR 
(SEQ ID NO: 65) . 

37. A peptide having the amino acid sequence ANPSPATRPLPTR 
(SEQ ID NO: 66) . 

38. A peptide having an amino acid sequence selected from 
the group consisting of RSTRPLPILPRTT, STPRPLPMLPTTR , 
STNRPLPMIPTTR , RSTRPLPSLPITT , STSRPLPSLPTTR , RSTRSLPPLPPTT , 
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RSTRQLPIPPTTT, STPRPLPLIPTTP, RSTRPLPPTPLTT, and RSTRPQPPPPITT 
(SEQ ID NOS: 85-94) • 

39. A peptide having the amino acid sequence selected from 
the group consisting of VLKRPLPIPPVTR (SEQ ID NO: 64), 
YSTRPVPPITRPS (SEQ ID NO:76), SHKSRLPPLPTRP (SEQ ID NO:77), 
GPHRRIiPPTPATR (SEQ ID NO: 65), PATRPLPTRPSRT (SEQ ID NO: 81), and 
SGGILAPPVPPRN (SEQ ID NO: 84). 

40. A peptide having the amino acid sequence selected from 
the group consisting of PPNSPLPPLPTHL (SEQ ID NO: 72), 
TGRGPLPPLPNDS (SEQ ID NO: 74), YRFRALPSPPSAS , LAQRQLPPTPGRD 
ALQRRIjPRTPPPA (SEQ ID NOS: 78-80), YSTRPLPSRPSRT, and 
XPGRILLIiPSEPR (SEQ ID NOS:82-83)- 

41. A construct comprising a nucleic acid encoding a 
peptide of claim 1 or its complement. 

42. The construct of claim 41 which is a DNA 
polynucleotide . 

43. The construct of claim 41 which is a RNA 
polynucleotide. 

44. A construct comprising a nucleic acid encoding a 
peptide of claim 18 or its complement. 

45. The construct of claim 44 which is a DNA 
polynucleotide . 

46. The construct of claim 44 which is a RNA 
polynucleotide . 

47. The construct of claim 41 which is a transforming 
vector. 

48. The construct of claim 44 which is a transforming 
vector. 

49. A host cell transformed with the vector of claim 47. 

50. A host cell transformed with the vector of claim 48. 

51. A conjugate comprising a peptide of claim 1 and a 
second molecule, 

52. The conjugate of claim 51 in which said second molecule 
is select d from the group consisting of an amino acid, a 
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peptide, a protein, a nucleic acid, a nucleoside, a glycosidic 
residue, a label, a drug or a small molecule. 

53. A diagnostic kit for the detection of SH3 domains 
comprising an SH3 domain-binding peptide and a detectable label 
conjugated to said peptide directly, indirectly or by 
complexation, said peptide comprising: (i) a core sequence motif 

5 of the formula RXLP00P (SEQ ID NO:71), wherein X represents any 
amino acid except cysteine and 0 represents a hydrophobic amino 
acid residue, each letter representing the standard one- letter 
designation for the corresponding amino acid residue; and (ii) 
two or more additional amino acid residues flanking said core 
10 sequence at its C- terminal end, N- terminal end or both. 

54 . A drug delivery system comprising an SH3 domain-binding 
peptide and a drug conjugated to said peptide directly, 
indirectly or by complexation, said peptide comprising: (i) a 
core sequence motif of the formula RXLP00P (SEQ ID NO: 71), 

5 wherein X represents any amino acid except cysteine and 0 
represents a hydrophobic amino acid residue, each letter 
representing the standard one-letter designation for the 
corresponding amino acid residue; and (ii) two or more additional 
amino acid residues flanking said core sequence at its C- terminal 
10 end, N- terminal end or both. 

55. The drug delivery system of claim 54 which may be 
administered parenterally , orally, enterally, topically or by 
inhalation. 

56. The drug delivery system of claim 54 which may be 
administered intranasally , opthalmically or intravaginally , 

57. The drug delivery system of claim 54 which is in the 
form of a solid, gel, liquid or aerosol. 

58. A method of modulating the activity of Src or Src- 
related proteins comprising administering a composition 
comprising an effective amount of a peptide of claim 1 and a 
carrier. 

59. The method of claim 58 which inhibits the activity of 
Src or Src-related proteins. 
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60. The method of claim 58 which activates Src or Src- 
related proteins. 

61. A method of identifying a peptide having a region that 
binds to an SH3 domain comprising: 

(a) providing an immobilized target protein comprising 
an SH3 domain; 

5 (b) incubating said immobilized target protein with an 

aliquot taken from a random peptide library; 

(c) washing unbound peptide from said immobilized 
target protein; 

(d) recovering the peptide bound to said immobilized 
10 target protein; and 

(e) determining the primary sequence of the SH3 
domain-binding peptide. 

62. The method of claim 61 in which said library is a 
displayed random peptide library. 

63. The method of claim 62 in which said library is a > 
phage-displayed random peptide library. 

64 . The method of claim 62 in which said library is a 
phagemid-displayed random peptide library. 

65. The method of claim 61 in which step (c) includes 
washing unbound phage from said immobilized target protein; step 
(d) includes recovering the phage bound to said immobilized 
target protein; and step (e) includes determining the relevant 

5 nucleotide sequence of said binding phage nucleic acid, from 
which the primary sequence corresponding to the SH3 domain - 
binding peptide is deduced. 

66. A method of identifying a peptide having a region that 
binds to an SH3 domain comprising: 

(a) providing an immobilized target protein comprising 
an SH3 domain; 

5 (b) incubating said immobilized target protein with an 

aliquot taken from a phage-displayed random peptide library, 
which library includes peptides having a random sequence of 
amino acid residues; 
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(c) washing unbound phage from said immobilized target 

protein; 

(d) recovering the phage bound to said immobilized 
target protein; and 

(e) determining the relevant nucleotide sequence of 
said binding phage nucleic acid and deducing the primary sequence 
corresponding to the SH3 domain-binding peptide, 

67. The method of claim 66 which further comprises 
amplifying the titer of the recovered phage. 

68. The method of claim 66 which further comprises 
repeating the steps of incubation, washing and recovery to 
provide SH3 domain-binding peptide-enriched phage. 

69- A pharmaceutical composition comprising an SH3 domain- 
binding peptide and a pharmaceutically acceptable carrier, said 
peptide comprising: (i) a 9-mer sequence motif of the formula 
RXLP00PX^ (SEQ ID NO:10), wherein X represents any amino acid 
except cysteine, represents a hydrophobic amino acid residue, 
and wherein i/' is a hydrophilic amino acid residue except 
cysteine, each letter representing the standard one- letter 
designation for the corresponding amino acid residue; and, 
optionally, (ii) additional amino acid residues flanking said 9- 
mer sequence at its C-terminal end, N-terminal end or both, up to 
a total of 45 amino acid residues, including said 9-mer sequence. 

70. The composition of claim 69 in which at least one 
additional amino acid flanks said 9-mer sequence. 

71. The composition of claim 69 in which at least two 
additional amino acids flank said 9-mer sequence. 

72. The composition of claim 69 in which at least three 
additional amino acids flank said 9-mer sequence . 

73. A method of disrupting protein tyrosine kinase -mediated 
signal transduction pathways comprising administering an 
effective amount of a peptide of claim 1. 

74. A method of regulating the processing, trafficking or 
translation of RNA by administering an effective amount of a 
peptide of claim 1. 
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Fig. 2 
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Fig. 3 
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Fig. 9 
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Sro 8H3 BINDZMO PEPTIDES AND 
yi Bn^nna op ib oi.atinq AWD USIWQ SAME 

1. Fi«Id of th« invention 

5 The present invention relates to SH3 binding 

peptides having a broad range of binding specificities. That 
is, certain members of the SH3 binding peptides disclosed 
bind with approximately the same facility with SH3 domains 
derived from different SH3 domain-containing proteins, other 

10 members, in contrast, bind with a much greater degree of 

affinity for specific SH3 domains. The SH3 binding peptides 
are obtained from random peptide libraries that are also 
phage-displayed. Methods are described of obtaining the 
phage clones that bind to the SH3 domain targets and of 

15 determining their relevant nucleotide sequences and 
consequent primary amino acid sequence of the binding 
peptides. The resulting SH3 binding proteins are useful in a 
number of ways, including, but not limited to, providing a 
method of modulating signal transduction pathways at the 

20 cellular level, of modulating oncogenic protein activity or 
of providing lead compounds for development of drugs with the 
ability to modulate broad classes, as well as specific 
classes, of proteins involved in signal transduction. 

25 2. Background of the invention 

2.1. arc and the SH3 Domain 

Among a number of proteins involved in 
eukaryotic cell signaling, there is a common sequence motif 
called the SH3 domain. It is 50-70 amino acids in length, 

30 moderately conserved in primary structure, and can be present 
from one to several times in a large number of proteins 
involved in signal transduction and in cytosJceletal proteins. 

The protein pp60c-src represents a family of at least 
nine non-receptor protein tyrosine kinases (NR-PTKs) . 

35 Memb rs of this family share an overall structural 

organization comprising a series of catalytic and non- 
catalytic domains. In Src, a 14-amino-acid myristylation 
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signal resides at the extr me amino-terminus , and is followed 
by a unique region that is not highly conserved among family 
members. Following this r gion are two highly conserved 60- 
and lOO-amino-acid regions, the Src homology (SH) domains 3 
5 and 2, respectively. SH2 and SH3 domains have been shown to 
play an important role in mediating protein-protein 
interactions in a variety of signaling pathways. Koch, C.A. , 
et al., in Science (1991) 252:668-74. The car boxy-terminal 
half of Src contains the PTK catalytic domain, as well as a 
10 negative regulatory tyrosine (Y527) near the carboxy 

terminus. Phosphorylation of this residue (e.g., by Csk) 
results in the inhibition of PTK activity. Cooper, J. A., et 
al., in Science (1986) 231:1431-1434. Mutation of Y527->F 
generates forms of Src with increased PTK and oncogenic 
15 activity. Cartwright, C.A., et al., in Cell (1987) 49:83-91; 
Kmiecik, T.E., et al., in Cell (1987) 49:65-73; and Piwicna- 
Worms, H., et al., in Cell (1987) 75-82. 

The fact that some mutations which result in increased 
Src PTK and transforming activity map to the Src SH2 (Seidel- 
20 Dugan, C. , et al., in Mol . Cell. Biol. (1992) 12:1835-45; and 
Hirai, H. and Varmus, H.E. in Mol. Cell. Biol. (1990) 
10:1307-1318) and SH3 domains (Seidel-Dugan, C. , et al., 
supra; Hirai, H. and Varmus, H.E., supra; Superti-Furga, G., 
et al., in Embo. J. (1993) 12:2625-34; and Potts, W.M. , et 
25 al., in oncogene Res. (1988) 3:343-355) suggests a negative 
regulatory role for these domains. That phosphotyrosine 
residues within specific sequence contexts represent high 
affinity ligands for SH2 domains suggests a model in which 
the SH2 domain participates in Y527-mediated inhibition of 
30 PTK activity by binding phosphor ylated Y527, thereby locking 
the kinase domain in an inactive configuration. Matsuda, M. , 
Mayer, B. J. , et al., in Scienfie (1990) 248:1537-1539. This 
model is supported by the observation that phosphopeptides 
corresponding to the carboxy-terminal tail of Src bind 
35 active, but not inactive, variants of Src. Rouss 1, R.R. , et 
al., -r. j^ r^r-. Natl. Acad. Sci. USA (1991) 88:10696-700; and 
Liu, X., et al., in Oncogen (1993) 8:1119-1126. 
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The mechanism of SH3 -mediated inhibition of Src PTK 
activity remains unci ar. There is evidence that pY527- 
mediat d inhibition of Src PTK activity involves the SH3 
domain as well as the SH2 domain. Okada, M. , Howell, et al., 
5 in .T. Biol. Chem. (1993) 268:18070-5; Murphy, S.M., et al., 
in MoT, cell. Biol. (1993) 13:5290-300; and Superti-Furga , 
G., et al., supra. Although these effects are thought to be 
a consequence of SH3-mediated protein-protein interactions, 
precisely how the Src SH3 domain exerts its negative 
10 regulatory effect is unclear. Identification of high 

affinity ligands for the Src SH3 domain could help resolve 
these issues. 

2.2. Protein Tyrosine Kinases and The Immune Response 

15 src-related tyrosine kinases are expressed in a 

variety of cell types including those of the immune system 
(lymphocytes, T cells, B cells, and natural killer cells) and 
the central nervous system (neural cells, neurons, 
oligodendrocytes, parts of the cerebellum, and the like) . 

20 Umemori, H. et al., in Brain Res. Mol. Brain Res. (1992) Dec. 
16 (3-4) : 303-310. Their presence in these cells and tissues 
and their interaction with specific cell surface receptors 
and immunomodulatory proteins (such as T cell antigen 
receptor, CD14, CD2, CD4, CD40 or CD45) suggest that these 

2S kinases serve an important role in the signalling pathways of 
not only the central nervous system but of the immune system, 
as well. See, e.g., Ren, C.L. et al., in Jr Exp» M^dt (1994) 
179 (2) : 673-680 (signal transduction via CD40 involves 
activation of Lyn kinase); Donovan, J. A. and Koretzky, G.A. , 

30 in J. J^m. Soc. N^nhrol. (1993) 4(4):976-985 (CD45, the immune 
response, and regulation of Lck and Fyn kinases); and Carmo, 
A.M. et al., Fn^. J. Immunol. (1993) 23 (9) : 2196-2201 
(physical association of the cytoplasmic domain of CD2 with 
p561ck and p59fyn) . 
35 For instance, mice with disruptions in their Src-like 

genes, Hck and Fgr, possess macrophages with impaired 
phagocytic activity or exhibit a novel immunodeficiency 
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characterized by an increased susc ptibility to infection 
with List ria monocytog nes. Lowell, C.A. et al., in g^ngs 
Dev, (1994) 8 (4) :387-398. Also, it has been shown that 
bacterial lipopolysaccharide (LPS) activates CD14-associated 
5 p561yn, p68hck, and p59c-fgr, while inducing the production 
of lymphokines , such as TNF-alpha, IL-1, IL-6, and IL-8. 
Inhibition of the protein tyrosine kinases blocks production 
of TNF-alpha and IL-1. 

10 2.3* 8H3 Binding Peptides 

As mentioned above, it has long been suspected that 
SH3 domains are sites of protein-protein interaction, but it 
has been unclear what SH3 domains actually bind. Efforts to 
identify ligands for SH3 domains have led to the 

15 characterization of a number of SH3-binding proteins, 

including 3BP1 and 2 (Ren, R. , Mayer, et al., in Sgj^nge 
(1993) 259:1157-61), SOS (Olivier, J.P. , et al-, in fifiH 
(1993) 73:179-91; and Rozakis-Adcock , M, , et al., in HfttUCg 
(1993) 363:83-5), p85 PI-3 • Kinase (Xingquan, , et al., in 

20 MqI. Cell. Biol. (1993) 13:5225-5232), dynamin (Gout, I., et 
al., in Call (1993) 75:25-36), AFAP-110 (Flynn, D.C., et al., 
in MqI. Cell. Biol. (1993) 13:7892-7900), and CD42 (Barfod, 
E.T., et al., in J. Biol. Chem, (1993) 268:26059-26062). 
These proteins tend to possess short, proline-rich stretches 

2 5 of amino acids, some of which have been directly implicated 
in SH3 binding. A variety of consensus sequences have been 
proposed, although the similarity among proline-rich regions 
of different SH3-binding proteins tends to be fairly low. 
Also, attempts to build consensus sequences are likely 

30 complicated by the incorporation of data from proteins that 
bind different SH3 domains. 

Thus, Cicchetti, P., et al., in Science (1992) 257:803- 
806, published their work relating to the isolation and 
sequencing of two naturally-occurring proteins that could be 

35 bound in vitro by the SH3 domain of the abl oncogene product. 
Th se workers found that SH3 domains bind short, proline-rich 
regions of such proteins. Subsequently, this sam group 
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disclosed further results (Ren, R. t al., supra) in which 
the SH3 binding sites of the SH3 binding proteins were 
localized to "a nine- or ten-amino acid stretch rich in 
proline residues." A consensus sequence incorporating the 
5 features of the SH3 binding sites of four SH3 binding 

proteins was proposed: XPXXPPPWXP (SEQ ID NO:l), wherein X 
indicates a position in the amino acid sequence which is not 
conserved among the four SH3 binding proteins, P represents 
proline, and ¥ indicates a hydrophobic amino acid residue, 

10 such as P or L. 

The screening of complex random peptide libraries has 
been used to identify peptide epitopes for monoclonal (Scott, 
J.K. and Smith, G.P. in Science (1990) 249:386-390) and 
polyclonal (Kay, B.K., et al., in Gene (1993) 128:59-65) 

15 antibodies, as well as peptide ligands for a variety of 
proteins, including streptavidin (Devlin, J.J., et al., in 
Science (1990) 249:404-406; and Lam, K. , et al., in Nature 
(1991) 354:82-84), the endoplasmic reticulum chaperone BiP • 
(Blond-Elguindi, S., et al., in Q&H (1993) 75:717-728), and 

20 CaM (Dedman, J.R. , et al., in J. Biol. Chero. (1993) 
268:23025-23030) . 

Recently, Chen, J.K. et al., in J. Am. Chem. Soc. (1993) 
115:12591-12592, described ligands for the SH3 domain of 
phosphatidylinositol 3-kinase (PI-3 • Kinase) which were 

25 isolated from a biased combinatorial library. A "biased" 
library is to be distinguished from a "random" library in 
that the amino acid residue at certain positions of the 
synthetic peptide are fixed, i.e., not allowed to vary in a 
random fashion. Indeed, as stated by these research workers, 

30 screening of a "random" combinatorial library failed to yield 
suitable ligands for a PI-3' Kinase SH3 domain probe. The 
binding affinities of these unsuitable ligands was described 
as weak, >100 fiH, based on dissociation constants measured by 
the Biosensor System (BIAcore) . 

35 

More recently, Yu, et al. (Yu, H. , et al. , in SSll 
(1994) 76:933-945) used a "biased" synth tic peptide library 
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Of the form XXXPPXPXX (SEQ ID NO: 2), wherein X repres nts any 
amino acid other than cysteine, to identify a series of 
peptides which bind the Src and PI-3» Kinase SH3 domains. 
The bias was accomplished by fixing the proline residues at 
5 the specific amino acid positions indicated for the "random" 
peptide. As stated previously, without this bias, the 
technique disclosed fails to identify any SH3 domain-binding 
peptides. 

A consensus sequence, based on 13 binding peptides was 

10 suggested: RXLPPRPXX (SEQ ID NO: 3), where X tends to be a 
basic residue (like R, K or H) . The binding affinities of 
several SH3 binding peptides were disclosed as ranging from 
8.7 to 30 ptM. A "composite" peptide, RKLPPRPRR (SEQ ID 
NO:4) , was reported to have a binding affinity of 7.6 /xM. 

15 This value compares favorably to the binding affinity of the 
peptide, VPPPVPPRRR (SEQ ID NO: 5), to the N-terminal SH3 
domain of Grb2 . See, Kraulis, P.J. J. ApdI. Crystal loar . 
(1991) 24:946. Recognizing the limitations of their 
technique, Chen and co-workers, supra, stated that their 

20 results "illustrate the utility of biased combinatorial 

libraries for ligand discovery in systems where there is some 
general knovrledgB of the ligetnd-binding characteristics of 
the receptor^ (emphasis added) . 

Yu and co-workers, supra, further described an SH3 

25 binding site consensus sequence, XpOPpXP (SEQ ID NO:6), 
wherein X represents non-conserved residues, 0 represents 
hydrophobic residues, P is proline, and p represents residues 
that tend to be proline. A consensus motif of RXLPPRPXX (SEQ 
ID NO: 7), where X represents any amino acid other than 

30 cysteine, was proposed for ligands of PI-3' Kinase SH3 

domain. A consensus motif of RXLPPLPR0 (SEQ ID NO: 8), where 
0 represents hydrophobic residues, was proposed for ligands 
of Src SH3 domain. Still, the dissociation constants 
reported for the 9-mer peptides ranged only from about 8-70 

35 /iM and selectivity between one type of SH3 domain and another 
was relatively poor, the KjjS differing by only about a factor 
of four. 
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Henc , th re remains a need to develop techniques for 
th id ntification of Src SH3 binding p ptid s which do not 
rely on such "biased" combinatorial peptide libraries that 
are limited to a partially predetermined set of amino acid 
5 sequences. Indeed, the isolation of SH3 binding peptides 
from a "random" peptide library has not been achieved 
successfully before now. Furthermore, particular peptides 
having much greater binding affinities, whether general or 
more selective binding for specific SH3 domains, remain to be 

10 identified. Binding peptides specific for particular SH3 
domains are useful, for example, in modulating the activity 
of a particular SH3 domain-containing protein, while leaving 
others bearing an SH3 domain unaffected. Still, the more 
promiscuous general binding peptides are useful for the 

15 modulation of a broad spectrum of SH3 domain-containing 
proteins. 

The present invention relates to such SH3 binding 
peptides, methods for their identification, and compositions 
comprising same. In particular, peptides comprising 

20 particular sequences of amino acid residues are disclosed; 
which were isolated from random peptide libraries. In the 
present invention, clones were isolated from a phage- 
displayed random peptide library which exhibited strong 
binding affinities for SH3 domain-containing protein targets. 

25 Some of these protein targets, include Abl, Src, Grb2, PLC-5, 
PLC-7, Ras GAP, Nek, and p85 PI-3 • Kinase. From the 
nucleotide sequence of the binding phage, the amino acid 
sequence of the peptide inserts has been deduced. Synthetic 
peptides having the desired amino acid sequences are shown to 

30 bind the SH3 domain of the target proteins. In particular, 
synthetic peptides combining a core consensus sequence and 
additional amino acid residues flanking the core sequence are 
especially effective at binding to particular target protein 
SH3 domains. The SH3 binding peptides disclosed herein can 

35 be utilized in a number of ways, including th potential 
modulation of oncog nic protein activity in vivo. These 
peptides also serve as useful leads in the production of 
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pept.idomimetic drugs that: modulate a large class of proteins 
involved in signal transduction pathways and oncogenesis. 

3. ?V"ff^rY invention 
5 Accordingly, three phage-displayed random peptide 

libraries were screened for isolates that bind to bacterial 
fusion proteins consisting of the Src homology region 3 (SH3) 
and glutathione S-transf erase (GST) . DNA sequencing of the 
isolates showed that they contained sequences that resembl 
10 the consensus motif, RPLPPLP (SEQ ID NO:9), within their 8, 
22, or 36 amino acid long random regions. When peptides were 
synthesized corresponding to the pIII inserts of the SH3- 
binding phage, they bound to the GST fusions of the SH3 
domains of Src and the Src-related proteins, such as Yes, but 
15 not of Grb2, Crk, Abl, or PLC7I. The synthesized peptides 
bind quite well to the Src SH3 domain and act as potent 
competitors of natural Src SH3 interactions in cell lysates. 
For instance, these peptides can compete with radiolabelled 
proteins from cell lysates in binding to immobilized Src-GST, 
20 with an apparent IC50 of 1-10 fM. When a peptide, bearing the 
consensus sequence RPLPPLP (SEQ ID NO: 9) was injected into 
XBnopus laevis oocytes, it accelerated the rate of 
progesterone-induced maturation. These results demonstrate 
the utility of phage-displayed random peptide libraries in 
25 identifying SH3 -binding peptide sequences and that such 
identified peptides exhibit both in vivo and in vitro 
biological activity . 

Thus, it is an object of the present invention to 
provide peptides having at least nine and up to forty-five 
3 0 amino acid residues, including an amino acid sequence of the 
formula, R-2-L-P-5-6-P-8-9 (SEQ ID NO:10), positioned 
anywhere along the peptide, in which each number represents 
an amino acid residue, such that 2 represents any amino acid 
residue except cysteine, 5 and 6 each represents a 
35 hydrophobic amino acid residue, 8 repres nts any amino acid 
residue except cysteine, and 9 repr sents a hydrophilic amino 
acid residue except cysteine, each 1 tter being th standard 
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one-letter symbol for the corresponding amino acid, said 
peptide exhibiting a binding affinity for the SH3 domain of 
Src, provid d that said peptid is not R-P-L-P-P-L-P-T-S (SEQ 
ID NO: 11). In a particular embodiment of the present 
5 invention, the peptides also exhibit a binding affinity for 
the SH3 domain of Src-related proteins, including Yes, Fyn, 
Lyn, Lck, Hck and Fgr. 

The present invention also contemplates SH3 domain- 
binding peptides that further comprise a C-terminal-f lanking 

10 amino acid sequence of the formula 10, 10-11, 10-11-12, 10- 
11-12-13 (SEQ ID N0:12) or 10-11-12-13-14 (SEQ ID N0:13), in 
which each number represents any amino acid residue except 
cysteine, such that 10 is bound to 9 by a peptide bond. 
Furthermore, peptides are also provided which further 

15 comprise an N-terminal-f lanking amino acid sequence of the 
formula 1*, 2'-l*, 3'-2*-l' or 4»-3'-2'-l' (SEQ ID NO:14) in 
which each number represents any amino acid residue except 
cysteine, such that 1* is bound to R by a peptide bond. 

Thus, in a particular embodiment, a peptide is disclosed 

20 having at least thirteen and up to forty-five amino acid 
residues, including an amino acid sequence of the formula, 
3 •-2'-l'-R-2-L-P-5-6-P-8-9-10 (SEQ ID NO:15), positioned 
anywhere along the peptide, in which each number represents..^ 
an amino acid residue, such that 3', 2', 1', 2, 8, and 10 

2 5 each represents any amino acid residue except cysteine, 5 and 
6 each represents a hydrophobic amino acid residue, and 9 
represents a hydrophilic amino acid residue except cysteine, 
each letter being the standard one-letter symbol for the 
corresponding amino acid, said peptide exhibiting a binding 

30 affinity for the SH3 domain of Src. 

The present invention also seeks to provide new 
consensus sequences or motifs that reflect variations in SH3 
domain binding selectivities or specificities. The present 
invention also contemplates conjugates of the SH3 binding 

35 peptides and a second mol cule or ch mical moiety. This 

s cond molecule may be any desired substance whose d liv ry 
to the region of the SH3 domain of a particular protein (or 

- 9 - 

SUBSTITUTE SHEET (RULE 26) 

BN8DOCID: <WO_9eoaB4flA1JA> . w 



AVO 96/03649 



PCr/US95/09382 



cell containing the protein) is s ught. Possible target 
cells include, but are not limited to, neural cells, immune 
cells (e.g., T cells, B cells, natural killer c lis, and the 
like) , osteoclasts, platelets, epidermal cells, and the lik , 
5 which cells express Src, Src-related proteins, and 

potentially, other SH3 domain-containing proteins. In this 
manner, the modulation of the biological activity of proteins 
bearing an SH3 domain can be accomplished. 

Other methods and compositions consistent with the 

10 objectives of the present invention are likewise disclosed. 
In particular, a method is disclosed of modulating the 
activity of Src or Src-related proteins comprising 
administering a composition comprising an effective amount of 
a peptide of the present invention and a carrier, preferably 

15 a pharmaceutical ly acceptable carrier. In a specific 

embodiment, the contemplated method results in the inhibition 
of the activity of Src or Src-related proteins. 
Alternatively, the method is effective to activate Src or 
Src-related proteins. 

2 0 In yet another embodiment, a method is disclosed of 

identifying a peptide having a region that binds to an SH3 
domain comprising: (a) providing an immobilized target 
protein comprising an SH3 domain; (b) incubating the 
immobilized target protein with an aliquot taken from a 

25 random peptide library; (c) washing unbound library peptid s 
from the immobilized target protein; (d) recovering the 
peptide bound to the immobilized target protein; and (e) 
determining the primary sequence of the SH3 domain-binding 
peptide. 

30 Moreover, a method is disclosed of imaging cells, 

tissues, and organs in which Src or Src-related proteins are 
expressed, which comprises administering an effective amount 
of a composition comprising an SH3 domain-binding peptide 
conjugated to detectable label or an imaging agent. 

35 Other objectiv s of the present invention will become 

apparent to one of ordinary skill in the art after 
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consideration of the abov disclosure and the following 
detailed d scription of the preferred embodiments. 

4. Brief Description of the Ficmrea 

5 FIG. 1 illustrates a scheme for the generation of a 

random 36 amino acid peptide library (TSAR-9; e.g., SEQ ID 
NO:16). Oligonucleotides were synthesized (SEQ ID NOS:17- 
18) , converted into double-stranded DNA, cleaved with 
restriction enzymes (SEQ ID NOS:19-20), and cloned into the 
10 M13 vector, m663. The random peptide region encoded by the 
oligonucleotides is shown in the box (SEQ ID NO: 16) and is 
situated at the N-terminus of mature protein III (SEQ ID 
N0:21). 

FIG. 2 illustrates a scheme for the generation of a 
15 random 22 amino acid peptide library (TSAR-12; e.g., SEQ ID 
NO:23). Oligonucleotides were synthesized (SEQ ID NOS:24- . 
25) , converted into double-stranded DNA, cleaved with 
restriction enzymes (SEQ ID NOS:26-27), and cloned into the 
M13 vector, m663. The random peptide region encoded by the 
20 oligonucleotides is shown in the box (SEQ ID NO: 23) and is 
situated at the N-terminus of mature protein III (SEQ ID 
NO:28). 

FIG. 3 illustrates a scheme for the generation of a ^ 
random 8 amino acid peptide library (R8C; SEQ ID NO: 30). 

25 Oligonucleotides were synthesized (SEQ ID NOS:31-32), 

converted into double-stranded DNA, cleaved with restriction 
enzymes (SEQ ID NOS:33-34), and cloned into the M13 vector, 
m663. The random peptide region (SEQ ID NO: 30) is flanked by 
cysteine residues and is situated at the N-terminus of mature 

30 protein III (SEQ ID NO:35). 

FIG. 4 illustrates the possible origin of one class of 
double- insert R8C recombinants (e.g., encoding SEQ ID NO: 36). 
Double-stranded oligonucleotides (e.g., SEQ ID NO:37) may 
have ligated in a head-to-head fashion at the Xba I site 

35 prior to cloning in the Xho I- Xba I cleaved M13 vector. 

FIG. 5 shows a list of random peptide recombinants (SEQ 
ID NOS: 38-61 and 106) isolated by the method of the present 
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invention and the displayed peptide sequence. The amino acid 
sequences are aligned to highlight the core sequences. The 
flanking s quences are shown to the N-terminal and C-terminal 
ends of the core sequence. 
5 FIG. 6 graphically illustrates the relative binding 

affinities of selected phage clones for various SH3 domains. 
The results indicate that certain amino acid sequences 
provide generic SH3 domain binding, while others can provide 
greater selectivity for the SH3 domain of Src. Still oth r 
10 clones exhibit Src SH3 domain preferential binding. 

FIG. 7 shows the binding of synthetic peptides (SEQ ID 
NOS:9 and 62-70) representing Src SH3-selected phage inserts 
to Src SH3-GST fusion target (shaded columns) over background 
GST binding (unshaded columns) relative to the core peptide 
15 RPLPPLP (SEQ ID NO: 9) and proline-rich peptide segments 
derived from naturally occurring proteins. Bound 
biotinylated peptide was detected with streptavidin-alkaline 
phosphatase ELISA. Each point was performed in triplicates- 
average absorbance at 405 nm is presented. Error bars 
2 0 represent SD. 

FIG. 8 illustrates the relative specificity of select d 
peptides (SEQ ID NOS:9 and 62-70) for SH3 domains derived 
from different proteins. In particular, the binding 
affinities of the peptides for the SH3 domains of the 
25 following protein fusion targets were tested: Src SH3-GST, 
Yes SH3-GST, Grb2-GST, Crk SH3-GST, Abl SH3-GST, PLC7I 
SH2SH3-GST. Bound biotinylated peptide was detected with 
streptavidin-alkaline phosphatase. Each point was perform d 
in triplicate; values are average signal (absorbance at 405 
30 nm) above GST background, with error bars representing 

standard deviation. Hatched bars indicate saturation of the 
ELISA signal. 

FIG. 9 presents the results of competition experiments 
in which selected peptides were found to inhibit the binding 
35 of proteins from cell lysates to immobilized Src SH3-GST or 
Abl SH3-GST prot in fusion targets. 
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FIG, 10 pr sents a graph illustrating the increas d rate 
of progesterone- induced maturation of oocytes injected with 
an SH3 domain-binding peptide, VLKRPLPIPPVTR (SEQ ID NO:64), 
of the present invention. Briefly, Stage VI oocyted were - 
5 prepared and injected as previously described (see, Kay, 
B.K., in ft^thodg in Cell Biol> (1991) 36:663-669). Oocytes 
were injected with 4 0 nL of 100 /xM test peptide or water. 
After injection, the oocytes were placed in 2 ^g/mh 
progesterone (Sigma, St. Louis, MO) and scored hourly for 

10 germinal vesicle breakdown (GVBD) . 

FIG. 11 shows the results of fluorescence experiments in 
which certain peptides. Panel A = VLKRPLPIPPVTR (SEQ ID 
NO:64), Panel B = GILAPPVPPRNTR (SEQ ID NO:63), Panel C = 
RSTPRPLPPLPTTR (SEQ ID NO: 67), of the invention were shown to 

15 localize within cellular compartments thought to contain Src 
or Src-related proteins. 

FIG. 12 illustrates a scheme for the generation of a 
biased peptide library. Oligonucleotides were synthesized, ; 
converted into double-stranded DNA, cleaved with restriction 

20 enzymes Xhol and Xbal, and cloned into the mBAX vector, 

described further below in the Examples section. The biased 
peptide region is situated at the N-terminus of mature pIII 
protein. 

FIG. 13 illustrates the peptide sequence encoded in the 
25 mBAX vector situated at the N-terminus of mature pi I I 
protein. 

5. Detailed Description of the Preferred Embodiments 
5.1. General considerations 

30 The present invention relates to peptides that 

exhibit a binding affinity for an SH3 domain, which domain 
has been found to be present in a number of physiologically 
significant proteins. In particular, peptides are disclosed 
which exhibit general binding characteristics to the SH3 

35 domains found in a group of proteins, including but not 

limited to Abl, Src, Grb2 , PLC-5, PLC-7, Ras GAP, Nek, and 
p85 PI-3' Kinase. Preferred peptid s xhibit sel ctive, if 
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not specific, binding affinity for the SH3 domain of Src. As 
d scrib d herein, the peptides of the present inv ntion 
include a core sequence, preferably a consensus seqeunce, and 
additional amino acid residues that flank the core sequence. 
5 These peptides, including the methods for their 

identification, are described in greater detail, below. 

Thus, in a specific embodiment of the invention, 
peptides are provided which have at least nine and up to 
about forty-five amino acid residues, including an amino acid 
10 sequence resembling the formula, 

R-2-L-P-5-6-P-8-9 (SEQ ID NO: 10), 
positioned anywhere along the peptide. In the above- 
mentioned formula, each number represents an amino acid 
residue, such that 2 represents any amino acid residue except 
15 cysteine, 5 and 6 each represents a hydrophobic amino acid 
residue, 8 represents any amino acid residue except cysteine, 
and 9 represents a hydrophilic amino acid residue except 
cysteine. Each letter used in the formulas herein represent 
the standard one-letter symbol for the corresponding amino 
20 acid. When the peptide is a 9-mer, the peptide 

R-p-L-p-p-L^p-T-S (SEQ ID NO: 11) is excluded. The peptides 
of particular interest are those that exhibit a binding 
affinity for the SH3 domain of Src and Src-related proteins, 
including Yes, Fyn, Lyn, Lck, Hck and Fgr. Preferably, the 
25 peptides of the invention exhibit a binding affinity for the 
SH3 domain of Src, which is at least three-fold, more 
preferably at least four-fold, most preferably at least about 
five-fold greater than that exhibited by the peptide RPLPPLP 
(SEQ ID NO:9). In still other embodiments, the peptides 
30 exhibit a binding affinity for the SH3 domain of Src which is 
at least ten-fold greater than that exhibited by the peptid 
RPLPPLP (SEQ ID NO:9)- 

In specific embodiments, peptides are disclosed in which 
the various amino acid residues at the indicated positions 
35 may independently have the following preferred identities: 2 
is a P, R, A, L, Q, E or S, more preferably P or R; 5 
represents a P, M, I or L, more pref rably P or M; 6 is a P, 
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I or V, more preferably P or L; 8 is a T, R, P, I, N, E, 
V, S, A, G or L, more preferably T or R; and 9 is a T, R, S, 
H or D, more preferably T or R. Despite the preference for 
hydrophobic amino acid residues at 5 and 6, in some cases it 
5 may be desirable to have hydrophilic amino acid residues at . 
these positions. Specifically, amino acid residue 5 may be a 
T, R or S, and amino acid residue 6 may be a T or R. 
Likewise, while a hydrophilic amino acid residue is preferred 
at position 9, in some instances a hydrophobic residue, such 

10 as a P or A, may be desirable. 

The present invention also contemplates SH3 domain- 
binding peptides with a minimum length of 10, 11, 12, 13, 14, 
15 or more amino acids. Such peptides contain additional 
amino acid residues flanking the core sequence of 

15 R-2-L-P-5-6-P (SEQ ID NO: 71) either at the C-terminal end, 

the N-terminal end or both. Thus, for example, such peptides 
include those that further comprise a C-terminal-f lanking 
amino acid sequence of the formula 10, 10-11, 10-11-12, 10-.. 
11-12-13 (SEQ ID NO:12) or 10-11-12-13-14 (SEQ ID NO:13), in 

2 0 which each number represents any amino acid residue except 

cysteine, such that the amino acid residue 10 is bound to the 
amino acid residue 9 by a peptide bond- In that case, 
specific embodiments include an amino acid residue 10 which;: 
is T, R, L, S, D, P, A or N, preferably T or R, an amino acid 

25 residue 11 which is R, P, A, Q, S or T, preferably R or P, an 
amino acid residue 12 which is P, S, R or T, preferably P or 
S, an amino acid residue 13 which is P, S, R, F, H or T, 
preferably P or S, and an amino acid residue 14 which is S, 
R, G or T, preferably, S or R. 

30 Furthermore, peptides are also provided which further 

comprise an N-terminal-f lanking amino acid sequence of the 
formula !•, 2«-l', 3"-2'-l' or 4»-3»-2*-l' (SEQ ID NO: 14) in 
which each number represents any amino acid residue except 
cysteine, such that 1' is bound to R by a peptide bond. In 

35 such a cas , specific embodiments are provided in which the 
amino acid r sidue 1" is T, P, S, N, F, W, K, H, Q or G, 
preferably T or P, wherein the amino acid residue 2* is S, T, 
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G, P, R, Q, A or H, preferably S or T, wherein the amino 
acid residue 3' is R, S, P, G, A, V, Y or L, preferably S or 
T, and wherein the amino acid residue 4' isR, S, V, T, G, L 
or F, preferably R or S. 
5 In a particular embodiment, a peptide is disclosed 

having at least thirteen and up to forty-five amino acid 
residues, including an amino acid sequence of the formula, 
3 t-2 •-l*-R-2-L-P^5-6-P-8-9-10 (SEQ ID NO:15), positioned 
anywhere along the peptide, in which each number represents 

10 an amino acid residue, such that 3", 2*, 1', 2, 8, and 10 

each represents any amino acid residue except cysteine, 5 and 
6 each represents a hydrophobic amino acid residue, and 9 
represents a hydrophilic amino acid residue except cysteine, 
each letter being the standard one-letter symbol for the 

15 corresponding amino acid, said peptide exhibiting a binding 
affinity for the SH3 domain of Src. Preferred 13-mers 
include, but are not limited to, those having an amino acid 
residue 5 which is a P or M, an amino acid residue 1' which 
is T, P, S or N, an amino acid residue 2' which is S or T, an 

2 0 amino acid residue 3' which is R or S, and an amino acid 
residue 10 which is T or R. In all the SH3 domain-binding 
peptides described herein, the prohibition against the use of 
the hydrophilic amino acid residue cysteine (C) does not 
extend beyond the 7-mer "core" sequence and the additional 

25 amino acid residues flanking the core up to a total (core + 
flanking) of about 20 amino acids. That is, the occasional 
use of a cysteine is not absolutely prohibited. What should 
be kept in mind is that the potential for the fomnation of 
intramolecular disulfide bonds, to form a cyclic structure, 

30 be minimized as much as possible. A^jplicants have found that 
cyclized structures appear to be disfavored, at least with 
potential binding peptides of less than about 15 amino acid 
residues in length. The concern for the formation of 
cyclized structures comprising the core sequence diminishes 

35 with increasing size of the peptide. Presumably, a large 
enough structure, though cyclic, may allow the critical cor 
sequence to adopt a more or 1 ss linear conformation. 
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In particular, specific peptides are disclosed which 
exhibit binding affinities to SH3 domains. These include the 
peptides, RSTPRPLPMLPTTR (SEQ ID NO^ 62), RSTPRPLPPLPTTR (SEQ 
ID NO. 67), GILAPPVPPRNTR (SEQ ID NO. 63), VLKRPLPIPPVTR (SEQ 
5 ID NO. 64), GPHRRLPPTPATR (SEQ ID NO. 65), and ANPSPATRPLPTR 
(SEQ ID NO. 66) . 

Phage clones are also disclosed, along with the amino 
acid sequences that are responsible for SH3 domain binding. 
These phage clones are identified in Figure 5. 
10 In other embodiments of the present invention, SH3 

domain-binding peptides are contemplated which have a total 
of 11, 13, 14, 18, 20, 22, 23, 25, 30, 36, 38 or 45 amino 
acid residues. 

The peptides of the present invention, having been 
15 disclosed herein, may be prepared by any number of 

practicable methods, including but not limited to solution-^ 
phase synthesis, solid-phase synthesis, protein expression by 
a transformed host, cleavage from a naturally-derived, 
synthetic or semi-synthetic polypeptide, or a combination of 
20 these techniques. 

The SH3 binding peptides exhibit a wide range of 
biological activity which includes the enhancement (or 
inhibition, depending on the particular peptide or the nature 
of the peptide's target molecule, in this case a protein 
25 bearing an SH3 domain) of the natural function or biological 
activity of the peptide's target molecule. For example, the 
interaction of the binding peptide of the present invention 
could result in the modulation of the oncogenic activity of 
the target molecule bearing the SH3 domain. If the target 
30 molecule has, in turn, a natural binding partner or ligand, 
the peptides of the present invention may also exhibit 
antagonistic or agonistic activity in relation to the 
biological activity of the natural binding partner. 

Thus, it is an object of the present invention to 
35 provide a method of activating Src or Src-related protein 
tyrosine kinases by administering an effective amount of the 
SH3 domain-binding peptides generally described herein. Th 
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int nsity of t:h immune response can thus be stimulated, for 
example, by the incr ased production of certain lymphokines, 
such as TNF-alpha and inter leukin-1 . As is gen rally known 
to those of ordinary skill in the art, a more intense immune 
5 response may be in order in certain conditions, such as in 
combating a particularly tenacious infection, viral or 
otherwise, or a malignancy. 

Furthermore, in a specific embodiment of the present 
invention, a conjugate compound is contemplated which 

10 comprises the peptide of the present invention and a second 
chemical moiety. The second chemical moiety can be selected 
from a wide variety of chemical compounds including the 
peptide itself- Typically, however, the second chemical 
moiety is selected to be other than the peptide of the 

15 present invention, including but not limited to an amino 
acid, a peptide other than an SH3 binding peptide of the 
present invention, a polypeptide or protein (i.e., the 
conjugate is a fusion protein) , a nucleic acid, a nucleoside, 
a glycosidic residue (i.e., any sugar or carbohydrate), a 

20 label or image-enhancing agent (including metals, isotopes, 
radioisotopes, chromophores , fluorophores (such as FITC, 
TRITC, and the like) , and enzyme substrates) , a drug 
(including synthetic, semisynthetic, and naturally-occurring 
compounds), small molecules (e.g., biotin, hormones, factors) 

25 and the like. 

The peptide of the present invention can be conjugated 
to the second chemical moiety either directly (e.g., through 
appropriate functional groups, such as an amine or carboxylic 
acid group to form, for example, an amine, imine, amide, 

30 ester, acyl or other carbon-carbon bond) or indirectly 
through the intermediacy of a linker group (e.g., an 
aliphatic or aromatic polyhydroxy, polyamine, polycarboxylic 
acid, polyolef in or appropriate combinations thereof) . 
Moreover, the term "conjugate," as used herein, is also meant 

35 to nc mpass non-covalent interactions, including but not 
limited to ionic, affinity or other complexation 
interactions. Preferably, such other non-covalent 
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interactions provide definable, most preferably, isolatable 
chemical conjugate species. 

As described further herein, th peptid s of the present 
invention have been shown to localize within certain cellular 
5 compartments which contain Src or Src-related proteins. 

Consequently, the above-described conjugate can be utilized 
as a delivery system for introduction of a drug to cells, 
tissues or organs that include SH3 domain-containing 
proteins. 

10 It should also be pointed out that the present invention 

seeks to provide a recombinant construct comprising a nucleic 
acid or its complement that includes codons or nucleotide 
sequences encoding a peptide having a region that binds to an 
SH3 domain, preferably the Src SH3 domain. The recombinant 

15 nucleic acid may be a DNA or RNA polynucleotide. 

In a specific embodiment, the present invention 
contemplates a recombinant construct which is a transforming 
vector. Such vectors include those well known to those of _ 
ordinary skill in the art, which effect the transfer or 

20 expression of the nucleotide sequence after introduction to a 
host, such as recombinant plasmid, phage or yeast artificial 
chromosome. These vectors may be closed circular loops or 
they may be linearized. The vectors contemplated include 
those that exist extrachromosomally after host transformation 

25 or transf ection, as well as those that integrate within or 
even displace portions of the host chromosome. The vectors 
may be introduced to the cell with the help of transfection 
aids or techniques well-known in the art. For example, these 
aids or techniques may take the form of electroporation, use 

3 0 of calcium chloride, calcium phosphate, DEAE dextran, 

liposomes or polar lipid reagents known as LIPOFECTIN or 
LIPOFECTAMINE. In addition, the present invention 
contemplates the direct introduction of the desired nucleic 
acid to the host cell, for instance, by injection. 

35 Transformed host cells are also obtained by the methods 

of the present invention which are capable of reproducing the 
polynucleotide s quences of interest and/or expressing the 
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corresponding peptide products. A variety of hosts are 
contemplated, including prokaryotic and eukaryotic hosts. In 
particular, bacterial, viral, yeast, animal, and plant cells 
are potentially transformable hosts. Thus, a method is 
5 disclosed to obtain a transformed host cell that can produce, 
preferably secrete, a peptide having a region that binds to 
an SH3 domain comprising (a) providing an expression vector, 
preferably a secretory expression vector^ comprising a 
nucleotide sequence encoding at least one copy of a peptide 

10 having a region that binds to an SH3 domain; and (b) 
introducing the vector to a competent host cell. 

The peptides, thus produced, may then be introduced to 
cells, tissues, organs, or administered to the subject for 
the purpose of modulating the biochemical activity of the SH3 

15 domain-containing proteins present therein. Accordingly, in 
specific embodiments of the present invention, compositions 
are provided which comprise an SH3 domain-binding peptide, 
including a core sequence and flanking sequences, and a 
suitable carrier. 

20 The compositions contemplated by the present invention 

may also include other components, from those that facilitate 
the introduction or administration of the compositions to 
those that have their own innate activity, such as a 
prophylactic, a diagnostic or a therapeutic action. Such 

2 5 innate activity may be distinct from that of the peptides of 
the present invention or be complementary thereto. In any 
event, the compositions of the present invention include 
those that are suitable for administration into mammals, 
including humans. Preferably, the compositions (including 

30 necessarily the carrier) of the present invention are 
sterile, though others may need only be cosmetically, 
agriculturally or pharmaceutically acceptable. Still other 
compositions may be adapted for veterinary use. 

The compositions, including the drug delivery systems 
35 described herein, are contemplated to be administered in a 
vari ty of ways, such as parenterally , orally, nterally, 
topically or by inhalation. The compositions may also be 
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adminster d intranasally , opthalmically or intravaginally • 
Furthermor , compositions of the invention can take 

several forms, such as solids, g Is, liquids, aerosols or 
patches* 

5 In another embodiment of the present invention a method 

is provided of identifying a peptide having a region that 
binds to an SH3 domain comprising: (a) providing an 
immobilized target protein comprising an SH3 domain; (b) 
incubating the immobilized target protein with an aliquot 

10 taken from a phage-displayed random peptide library, which 
library includes peptides having a random sequence of ^8 
amino acid residues; (c) washing unbound phage from the 
immobilized target protein; (d) recovering the phage bound to 
the immobilized target protein; and (e) determining the 

15 relevant nucleotide sequence of said binding phage nucleic 
acid and deducing the primary sequence corresponding to the 
SH3 domain-binding peptide. Preferably, the method further 
comprises amplifying the titer of the recovered phage and 
repeating the steps of incubation, washing and recovery to 

2 0 provide SH3 domain-binding peptide-enriched phage. 

Any other mode by which the peptide library, random or 
otherwise, can be "displayed" can be utilized in the present 
invention, however. Moreover, the present applicants believe 
that longer random peptide sequences (e.g., >6 amino acid 

25 residues, preferably >10, and most preferably, >12) provide 
not only much greater diversity but also a richer degree of 
secondary structure conducive to binding activity. If the 
random region of the peptide is less than or equal to an 8- 
mer, it should preferably not be cyclized. 

30 

5.2. Preparation of Random Peptide Libraries 

The preparation and characterization of the 
preferred phage-displayed random peptide libraries have been 
described elsewhere. See, for example, Kay, B.K. et al. in 
35 Gene (1992) 128:59-65, for a description of the preparation 
of the phage-displayed random peptide library known as TSAR- 
9, more b low. In particular, by cloning degenerate 
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oligonucleotides of fixed length into bacteriophag vectors, 
recombinant librari s of random peptides can be generated 
which are expressed at the amino-terminus of the pIII protein 
on the surface of M13 viral particles- (There are 3-5 copies 
5 of the plll-fusion on the surface of each particle.) Phag 
display offers several conveniences: first, the expressed 
peptides are on the surface of the viral particles and 
accessible for interactions; second, the recombinant viral 
particles are stable (i.e., can be frozen, exposed to pH 
10 extremes) ; third, the viruses can be amplified; and fourth, 
each viral particle contains the DNA encoding the recombinant 
genome. Consequently, these libraries can be screened by 
isolating viral particles that bind to targets. The isolates 
can be grown up overnight, and the displayed peptide sequence 
15 responsible for binding can be deduced by DNA sequencing. 
These libraries have approximately >10* different 
recombinants, and nucleotide sequencing of the inserts 
suggests that the expressed peptides are indeed random in 
amino acid secjuence. These libraries are referred to herein 
20 as TSAR libraries, where TSAR stands for Totally Synthetic 
Affinity fieagents. The preparation of the TSAR libraries are 
described further below. 

5.3. 8H3 Binding Clones And Their Characteristics 

25 Accordingly, peptides have been isolated from an 

unconstrained random peptide library which exhibit a binding 
affinity for SH3 domains. Furthermore, the binding 
affinities exhibited by the disclosed peptides differ in 
their selectivities with certain peptides showing comparable 

3 0 binding affinities for SH3 domains derived from different 
proteins, while others manifest greater affinities for 
specific SH3 domains. 

The amino acid sequence of various peptides isolated by 
the present method are listed in Figure 5. As can be seen 

35 from this list, c rtain groups of SH3 domain binding peptides 
are is lat d from three separat random peptide librari s, 
each bas d on a different type of random p ptide insert, all 
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displayed at the amin -terminus of the pIII prot in on the 
surface of M13 viral particles. T n clon s w re isolated 
from the R8C library, seven from the TSAR-12 library, and 
seven from the TSAR-9 library* The sequences are presented 
5 to highlight the particular amino acid residues believed to 
bind directly to the SH3 domain, as well as to point out the 
remaining amino acid resiudes of the random insert and the 
viral flanking sequences and complementary site amino acid 
residues common to each group of clones. The frequency with 
10 which each particular clone is found in each library is also 
indicated in Figure 5. Thus, clones T12.SRC3.1 and 
T12.SRC3.2 are by far the most abundant clones found among 
the three libraries. 

Interestingly, all the binding peptides are found to 
15 have the proline-rich amino acid residue motif, which is 
apparently responsible for binding, the motif being located 
predominantly at the C-terminal end of the insert, although 
each clone also contains an insert at the N-terminal end. 
The significance of this observation is not presently 
20 understood, although this finding may indicate the possible 
importance of the C-terminal viral flanking sequences in SH3 
domain binding. 

Indeed, a synthetic peptide bearing only the core 
consensus sequence RPLPPLP (SEQ ID NO: 9) was less effective 
25 in binding to target SH3 domains than synthetic peptides that 
also included additional amino acid residues flanking the 
core sequences. Thus, 13-mers and 14-mers having the 
sequences RSTPRPLPMLPTTR (SEQ ID NO: 62), RSTPRPLPPLPTTR (SEQ 
ID NO:67), GILAPPVPPRNTR (SEQ ID NO:63), GPHRRLPPTPATR (SEQ 
30 ID NO:65), and VLKRPLPIPPVTR (SEQ ID NO:64) have been 

prepared and shown to bind to SH3 domains, such as those of 
Src and Yes, much more avidly than the 7-mer, RPLPPLP (SEQ ID 
NO:9). The 13-mer ANPSPATRPLPTR (SEQ ID NO: 66) has been 
shown to have binding affinities comparable to the core 
35 consensus sequenc . In each case, the 13-mers comprise a 7- 
mer "core" sequence plus additional amino acid residues 
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flanking same, some of which additional amino acid residues 
are contributed by the viral flanking sequences. 

Thus, in one embodim nt of the present invention, a 7- 
mer core includes a consensus motif of the formula RXLP00P 
5 (SEQ ID NO:71), wherein R is arginine, L is leucine, P is 
proline, X represents any amino acid except cysteine and 0 
represents a hydrophobic amino acid residue* By "hydrophobic 
amino acid residue," the applicants mean to include F, Y, W, 
V, A, I, L, PorM, each letter representing the standard 
10 one-letter designation for the corresponding amino acid 
residue. 

Furthermore, a preferred 9-mer peptide comprising two 
additional amino acids on the C-terminal end of the core 
sequence is envisioned having a consensus motif of the 

15 formula RXLP00PX^. In this preferred 9-mer consensus motif, 
the symbol ^ represents a hydrophilic amino acid residue, 
except cysteine. By "hydrophilic amino acid residue," the 
applicants mean to include K, R, H, D, E, N, Q, T, S or C, 
and the other symbols are as defined above. For the purposes 

20 of the present invention, a glycine residue (G) may be 

considered either a hydrophobic or a hydrophilic amino acid 
residue. The one-letter symbols B and 2, which stand for N 
or D and Q or E, respectively, are considered hydrophilic 
amino acid residues. 

25 Particular 13-mer peptides of the present invention 

include those listed, below. It is noted, however, that not 
all the following 13-mer peptides correlate strictly to or 
comply with the preferred 9-mer consensus motif, described 
above. Those peptides that do not comply (indicated in 

30 italics, with the non-complying amino acid residues 

underscored) can, thus, be described as "resembling" those 
that do comply (indicated in normal type) with the preferred 
9-mer consensus motif: PGFRELPPLPPSR (SEQ ID NO: 72), 
AQSRPLPIPPETR (SEQ ID NO:73), VLKRPLPIPPVTR (SEQ ID NO:64), 
35 PPN^PLPPLPTHL (SEQ ID NO: 74), TGRGPLPPLPNDS (SEQ ID NO:75), 
YSTRPVPPITRRS (SEQ ID NO: 76), SHK^RLPPLPTRP (SEQ ID NO: 77), 
YRFRALP&PPSAS (SEQ ID NO: 78), GPHRRLPPZPATR (SEQ ID NO: 65), 
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LAQRQLPP^PGRD (SEQ ID NO: 79), ALQRRLPBXPPEA (SEQ ID NO: 80), 
PATRPLPXBPSRT (SEQ ID NO: 81), YSTRPLPSEPSRT (SEQ ID NO: 82), 
XPGRILSiLPS.EE.R (SEQ ID NO: 83), SGGLLAPPVPPRH (SEQ ID NO: 84), 
RSTRPLPILPRTT (SEQ ID NO:85), STPRPLPMLPTTR (SEQ ID NO:86)v 
5 STNRPLPMIPTTR (SEQ ID NO: 87), RSTRPLP^LPITT (SEQ ID NO: 88), 
STSRPLPS.LPTTR (SEQ ID NO:89), RSTRSLPPLPPTT (SEQ ID NO: 90), 
RSTRQLPIPPTTT (SEQ ID NO:91), STPRPLPLIPTTP (SEQ ID NO:92), 
RSTRPLPPTPLTT (SEQ ID NO: 93), and RSTRPQPPPPITT (SEQ ID 
NO:94). Accordingly, other peptides not specifically 
10 disclosed, which either comply with or "resemble" the 

preferred 9-mer consensus motif, can be readily envisioned by 
those of ordinary skill in the art and are considered to be 
equivalent to those that are specifically disclosed above. 
In particular, non-compliance at positions 1 (S, G, and I, in 
15 place of R, are tolerated) , 3 (V, A, and Q, in place of L, 
are tolerated), 4 (L, in place of P, is tolerated), 5 
(hydrophilic amino acid residues, S, R, and T, are tolerated^ 
in place of a hydrophobic amino acid residue) , 6 (hydrophilic 
amino acid residues, R and T, are tolerated in place of a 
20 hydrophobic amino acid residue) , 7 (T, and S, in place of P,, 
are tolerated), and 9 (P and A are tolerated in place of a 
hydrophilic amino acid residue) have been observed. 

5.3.1. Binding specificities 

2 5 It has been discovered that certain of the 

binding peptides disclosed have a greater relative binding 
affinity for one SH3 domain over another. Referring now to 
Figure 8, the relative binding affinities of the various 
peptides described above toward different SH3 domain targets 

30 are graphically presented. As one can see, the relative 
binding affinities of the respective peptides can differ by 
orders of magnitude. Thus, as shown in Figure 8, the peptide 
GPHRRLPPTPATR (SEQ ID NO: 65), having the relevant sequence of 
the phage clone identified as T12.SRC3.3, is specific to Src 

35 family SH3 domains, including, but not limited to, Src, Yes, 
Lck, Hck, Fgr, Fyn, and Lyn. This SH3 binding peptide has 
little affinity for SH3 domains derived from PLC7 or Grb2 . 
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On th other hand, the peptide GILAPPVPPRNTR (SEQ ID NO: 63), 
corr spending to the r levant s quence of the phage clone 
T12.SRC3.1, which is one of the m st abundant binding clones 
found by the present method, binds generically to a broad 
5 range of SH3 domains, including Src, PLC7, and Grb2 . 

On an intermediate level, the present invention has also 
uncovered a peptide, VLKRPLPIPPVTR (SEQ ID N0:64), 
corresponding to the relevant sequence of the phage clone 
T12-SRC3.6, which is Src preferential; that is, this peptide 

10 exhibits strong binding affinities for members of the Src 
family, some binding affinities for Grb2 proteins, but littl 
binding affinities for PLC7 domains. The peptide 
ANPSPATRPLPTR (SEQ ID NO: 66), corresponding to the relevant 
sequence of the phage clone T12.SRC3,2, also exhibits Src 

15 family specificity similar to GPHRRLPPTPATR (SEQ ID NO: 65). 
The peptides RSTPRPLPMLPTTR (R8C.YES3.5; SEQ ID NO: 62) and 
RSTPRPLPPLPTTR (representative consensus motif; SEQ ID NO: 67) 
are highly specific for SH3 domain of Src, Yes, and other 
Src-r elated proteins. 

20 

5,4. Further Discussion of Binding Experiments 

At the outset it is apparent that the binding 
affinity of certain peptides to the SH3 domain of Src and 
Src-related proteins is governed by more than just the 

2 5 presence of the preferred core consensus sequences, RPLPPLP 
(SEQ ID NO:9) or RPLPMLP (SEQ ID NO:95; i-e. , RPLP(P/M)LP, 
SEQ ID NO:96). Thus, while the synthetic peptides 
RSTPiU>IiPMiPTTR (R8C.YES3,5; SEQ ID NO: 62) and RSTPRPI^PPLPTTR 
(consensus; (SEQ ID NO: 67) exhibit a strong specific binding 

30 affinity for Src SH3, the other synthetic peptides , tested 
also exhibited an avid binding affinity to SH3 domains 
relative to the 7-mer, RPLPPLP (SEQ ID NO: 9). These other 
peptides, GILAPPVPPi^NTR (SEQ ID NO: 63), VLKJ^PLPJPPVTR (SEQ ID 
NO: 64), GPHi?iiLPPrPATR (SEQ ID NO: 65), and ANPSPATJ^PLPTR (SEQ 

35 ID NO: 66), sport core sequences and flanking sequenc s that 
do not closely adhere to the preferred core consensus 
sequences. Thus, these results suggest that binding affinity 
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to SH3 domains is gov rned to a large extent by the nature of 
the amino acid residu s flanking the core 7-mer sequence. 

The binding characteristics of Src SH3-selected peptides 
was determined using synthetic biotinylated peptides 
5 corresponding to the sequences displayed by Src SH3 -selected 
phage. These biotinylated peptides were assayed for direct 
binding to immobilized Src SH3-GST. Each of the five 
library-derived peptides tested were found to bind to Src 
SH3-GST and Yes SH3-GST over background (Figure 8) . 

10 Furthermore, a strong correlation was observed between the 
similarity of a given peptide to the preferred core consensus 
sequence RPLP(P/M)LP and the peptide's affinity for Src SH3- 
GST. The core sequence of the clone T12.SRC3.1 
(GILAPfVPPPNTR; SEQ ID NO: 63) appears to provide more generic 

15 SH3 domain-binding characteristics. 

Experiments comparing the relative binding of various - 
phage clones to SH3 domains taken from a variety of proteins 
demonstrated the preference of these clones for Src and Src- 
related SH3 domains over SH3 domains taken from other 

20 proteins* 

It was further found that while the 7-mer having the 
consensus sequence RPLPPLP (SEQ ID NO: 9) bound to Src SH3-GST 
only weakly, peptides comprising the consensus sequence 
flanked by residues encoded by one of the Src SH3 -selected 

25 clones (R8C. YES3 • 5) , RSTP (SEQ ID NO:97) at the N-terminal 

end and TTR at the C-terminal end, bound significantly better 
than any of the peptides tested (Figure 7). Thus, as stated 
previously, sequences that flank the RPLP(P/M)LP (SEQ ID 
NO: 96) core appear to be important contributors to SH3 

30 binding. It is further surmised that a peptide having or 

resembling the sequence RSTPAPPVPPRTTR (SEQ ID NO: 98) should 
exhibit strong but generic binding to a variety of SH3 
domains. 

Similarly, it is observed that most of the Src SH3- 
35 binding motifs are located near the carboxy-terminus of the 
random peptides, adjacent to sequences which are fixed in 
every clone (Figur 5) . The exceptional clones tend to 
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4 



poss ss seguenc s that resemble motifs that include fixed 
flanking s quences- This clustering contrasts with previ us 
results, in which binding motifs are distributed throughout 
the random peptide. Kay, B.K. , et al., in gepe (1993) 
5 128:59-65- 

The binding of the library-derived Src SH3-binding 
peptides was compared to that of peptides corresponding to 
proline-rich regions of natural proteins. Peptides 
corresponding to SH3 -binding regions in human PI-3 • Kinase 

10 (KISPPTPKPRPPRPLPV; SEQ ID NO: 69) and human SCSI. 2 0 

(GTVEPVPPPVPPRRRPESA; SEQ ID NO: 68), as well as a proline- 
rich region of the cytoskeletal protein vinculin 
(LAPPKPPLPEGEV; SEQ ID NO:70), bound Src SH3 less well than 
the library-derived peptides (Figure 7). 

15 As mentioned above, the relative specificity of binding 

was explored. Thus, the relative binding of Src SH3-select d 
peptides to equal amounts of GST fusions to SH3 domains from 
different proteins was determined (Figure 8) . While all of 
the library-derived peptides bound the Src and Yes SH3 

20 domains almost equally well, none of the peptides (with the 
exception of peptide T12.SRC3.1, the most divergent peptide 
tested) bound the SH3 domains of Grb2 , Crk, Abl or PLC7I 
appreciably. Thus, the library-derived peptides, in contrast 
with a peptide derived from SOSl, exhibit SH3 binding that is 

2 5 relatively specific for Src-family members. 

Next, it was determined whether the binding to the Src 
SH3 domain was qualitatively like the interactions of the SH3 
domain and natural proteins found in cell lysates. Thus, 
radiolabeled proteins were prepared from NIH 3T3 cell lysates 

3 0 and chromatographed over Src SH3-GST immobilized on 

glutathione linked Sepharose. SDS-PAGE shows that a number 
of proteins can be affinity purified in this manner. The 
synthesized peptides bind quite well to the Src SH3 domain, 
as they can compete the binding of radiolabeled proteins from 
35 cell lysates to immobiliz d Src-GST, with an IC50 of 1-10 mM 
(Figure 9) . In conclusion, the peptides can efficiently 
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block the interaction of cellular proteins with Src SH3 in 
vitro n 

Moreover, Xenopus laevis oocytes injected with mRNA 
encoding constitutively active Src undergo progesterone- 
5 induced maturation at an accelerated rate relative to oocytes 
injected with water or c-Src mRNA. Unger, T.F. and Steele, 
R.E. in MQl. cell. Biol. (1992) 12:5485-5498. To explore the 
ability of the library-derived Src SH3-binding peptides to 
exert a biochemical effect in vivo, the influence of the 

10 peptides on the maturation of Xenopus laevis oocytes was 
examined. Hence, stage VI oocytes were injected with 
peptide, exposed to progesterone, and scored for germinal 
vesicle breakdown. Figure 10 shows that the rate of 
maturation was accelerated by approximately one hour when 

15 oocytes were injected with the SH3-binding peptide consisting 
of RPLPPLP (SEQ ID NO: 9) flanked by residues from clone 
T12.SRC3.6 (VLKRPLPIPPVTR; SEQ ID NO: 64), but not with water 
or a peptide corresponding to a proline-rich segment of 
vinculin (LAPPKPPLPEGEV; SEQ ID NO: 70) as controls. The 

20 magnitude of this effect is roughly equivalent to that seen ^ 
with injection of mRNA encoding constituitively active Src. 
See, e.g.. Figure 3B in Unger, T.F. and Steele, R.E., supra. 
This result suggests that the library-derived Src SH3 -binding 
peptide is effectively relieving an inhibitory effect of the 

2 5 Src SH3 domain upon Src PTK activity. This model is 

consistent with a number of studies which have demonstrated 
an inhibitory effect of the Src SH3 domain upon Src kinase 
and transforming activity. See, e.g., Okada, M. , et al. , 
supra; Murphy, S.M,, et al., supra; and Superti-Furga , G. , et 

3 0 al. , supra. 

5.5. Diagnostic And Therapeutic Agents 

Baaed On 8H3 Binding Peptides and Additional 
Methods of Their Use 

As already indicated above, the present invention 
also seeks to provide diagnostic, prophylactic, and 
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therap utic agents based on the SH3 binding peptides 
described herein* 

In one embodiment, diagnostic agents are provided, 
preferably in the form of kits, comprising an SH3 domain- 
5 binding peptide and a detectable label conjugated to said 
peptide directly, indirectly or by complexation, said peptide 
comprising: (i) a core sec[uence motif of the formula RXLP00P 
(SEQ ID NO: 71), wherein X represents any amino acid except 
cysteine and <p represents a hydrophobic amino acid residue, 

10 including F, Y, W, V, A, I, L, P, M or G, each letter 

representing the standard one-letter designation for the 
corresponding amino acid residue; and (ii) two or more 
additional amino acid residues flanking said core sequence at 
its C-terminal end, N-terminal end or both. 

15 The diagnostic agents of the present invention can be 

used to detect the presence of SH3 domains of a generic or 
specific type in cells, tissues or organs either in vitro or 
in vivo. For in vivo applications, the diagnostic agent is 
preferably mixed with a pharmaceutically acceptable carrier 

20 for administration, either enterally, parenterally or by some 
other route dictated by the needs of the particular 
application. 

In a particular embodiment, for example, an assay based 
on a fusion product is contemplated which comprises a Src SH3 

25 domain-binding peptide of the invention and a substrate for 
deregulated or "activated" Src. For instance, a muscle 
biopsy, taken from a subject suspected of being infected by 
the Rous sarcoma virus, can be treated with an effective 
amount of the fusion product. By subsequent analysis of the 

30 degree of conversion of the substrate, one can potentially 
detect infection by the Rous sarcoma virus in the subject, 
particularly mammals, especially chickens. The presence of 
the retrovirus, which causes the expression of deregulated or 
"activated" Src, may thus be indicated by unusually high 

35 levels of Src as revealed by large amounts of the converted 
substrate. Se , for xample, Paxton, W.G. et al., in 
Biochem. Biophvs. Res. Commun. (1994) 200 ( 1) : 260-267 
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(d tection of phosphorylated tyrosine and serine residu s of 
angiotensin II ATI r ceptor, a ubstrate of Src family 
tyrosine kinases) ; another suitable substrate may be th 
protein p68 (Fumagalli, S. et al., in Nature (1994) 
5 368(6474) :871-874; Taylor, S*J. and Shalloway, D. , in lPi<S^f 
at 867-871. 

Alternatively, the enzyme can be isolated by selective 
binding to a form of the SH3 domain-binding peptides of the 
present invention (e.g., biotin-peptide conjugate). After 

10 isolation of the protein-peptide conjugate complex (e.g., on 
a column comprising streptavidin) , the activity of the enzyme 
can then be assayed by conventional methods to determine its 
level of protein kinase activity which can be taken as an 
indication of the presence of the deregulated or "activated" 

15 form of the enzyme. An assay for Src kinase has been 

described by Klinz and Maness, in Neuroprotocols (a companion 
to Neuroscience ) fl992) 1 (3 ): 224-231 . 

Moreover, the diagnostic agents of the invention can 
also serve as imaging agents of cells, tissues or organs, 

2 0 especially those that contain proteins with an SH3 domain ♦ 

For example, neural cells (e.g., neurons, other areas of the 
brain), osteoclasts, osteoblasts, platelets, immune cells, 
and other dividing cells are known to express or contain 
proteins with SH3 domains. Thus, an image can be taken of 
25 portions of the body to serve as a baseline for subsequent 
images to detect physiologic or biochemical changes in the 
subject's body. For instance, changes in the condition of 
cellular levels of Src or a transformation of the cellular 
Src to an "activated" form may be detected using the 

3 0 diagnostic or imaging agents of the present invention. 

Accordingly, it has been demonstrated that an SH3- 
binding peptide tagged with a fluorescence emitter can 
provide an image of the cytoskeleton. The images are 
presented in Figure 11. As can be seen from Figure 11, 
3 5 pan Is A, B, and C show the fluorescence image that is 

obtain d on treating NIH 3T3 fibroblasts with SH3 domain- 
binding peptides modified to include a fluorescent tag. In 
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Sharp contrast, panel D shows only a dark image that is 
produced when the cells are treated with a proline-rich 
segment of vinculin as a control. 

In another embodiment, an SH3 domain-binding peptide- 
5 horseradish immunoper oxidase complex or related 

immunohistochemical agent could be used to detect and 
quantitate specific receptor molecules in tissues, serum or 
body fluids. In particular, the present invention provides 
useful diagnostic reagents for use in immunoassays. Southern 
10 or Northern hybridization, and in situ assays. Accordingly, 
the diagnostic agents described herein may be suitable for 
use in vitro or in vivo. 

In addition, the diagnostic or imaging agent of the 
present invention is not limited by the nature of the 
15 detectable label. Hence, the diagnostic agent may contain 
one or more such labels including, but not limited to, 
radioisotope, fluorescent tags, paramagnetic substances, 
heavy metals, or other image-enhancing agents. Those of 
ordinary skill in the art would be familiar with the range of 
2 0 label and methods to incorporate or conjugate them into the 
SH3 domain-binding peptide to form diagnostic agents. 

In yet a further embodiment, pharmaceutical compositions 
are provided comprising an SH3 domain-binding peptide and a 
pharmaceutically acceptable carrier. In a specific 
25 embodiment of the invention, the pharmaceutical composition 
is useful for the modulation of the activity of SH3 domain- 
containing proteins. By "modulation" is meant either 
inhibition or enhancement of the activity of the protein 
target. Accordingly, a pharmaceutical composition is 
30 disclosed comprising an SH3 domain-binding peptide and a 

pharmaceutically acceptable carrier, said peptide comprising: 
(i) a 9-mer sequence motif of the formula I(XLP<fxpPX\p (SEQ ID 
NO: 10), wherein X represents any amino acid except cysteine, 
0 represents a hydrophobic amino acid residue, and wherein ^ 
35 is a hydrophilic amino acid residue except cysteine, each 
lett r representing the standard one-lett r designation for 
the corresponding amino acid residu ; and, optionally, (ii) 
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additional amino acid residues flanking the 9-iner s quence at 
its C-terminal end, N-terminal end or both, up to a total of 
45 amino acid residues, including said 9-mer sequence. 
Preferably, the peptide comprises at least one, more 
5 preferably at least two, and most preferably at least three 
additional amino acids flanking the 9-mer sequence. 

As stated above, the therapeutic or diagnostic agents of 
the invention may also contain appropriate pharmaceutically 
acceptable carriers, diluents and adjuvants. Such 

10 pharmaceutical carriers can be sterile liquids, such as water 
and oils including those of petroleum, animal, vegetable or 
synthetic origin, such as peanut oil, soybean oil, mineral 
oil, sesame oil and the like. Water is a preferred carrier 
when the pharmaceutical composition is administered 

15 intravenously. Saline solutions and aqueous dextrose and 
glycerol solutions can also be employed as liquid carriers, 
particularly for injectable solutions. Suitable 
pharmaceutical excipients include starch, glucose, lactose, 
sucrose, gelatin, malt, rice, flour, chalk, silica gel, 

20 magnesium carbonate, magnesium stearate, sodium stearate, 
glycerol monostearate, talc, sodium chloride, dried skim 
milk, glycerol, propylene, glycol, water, ethanol and the 
like. These compositions can take the form of solutions, 
suspensions, tablets, pills, capsules, powders, sustained- 

25 release formulations and the like. Suitable pharmacevitical 
carriers are described in "Remington's Pharmaceutical 
Sciences" by E.W. Martin. 

Such compositions will contain an effective therapeutic 
amount of the active compound together with a suitable amount 

3 0 of carrier so as to provide the form for proper 

administration to the subject. While intravenous injection 
is a very effective form of administration, other modes can 
be employed, including but not limited to intramuscular, 
intraperitoneal, and subcutaneous injection, and oral, nasal, 

35 enteral, and parenteral administration. 

The therapeutic agents and diagnostic agents of th 
instant inv ntion are used for the tr atment and/or diagnosis 
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of animals, and more preferably, mammals including humans, as 
well as dogs, cats, hors s, cows, pigs, guinea pigs, mic and 
rats. Accordingly, other methods contemplated in the pres nt 
invention, include, but are not limited to, a method of 
5 modulating, i*e., inhibiting or enhancing, bone resorption in 
a mammal (see, e.g.. Hall, T.J., in Biochem. Bioohvs, „Res. 
Commun. (1994) 199 (3) : 1237-^44) , a method of disrupting 
protein tyrosine kinase-mediated signal transduction pathways 
or a method of regulating the processing, trafficking or 
10 translation of RNA in a cell by introducing or administering 
an effective amount of an SH3 domain-binding peptide of the 
present invention (see, e.g., Taylor, S.J. and Shalloway, D. , 
supra) ♦ 

The diagnostic or therapeutic agents of the present 

15 invention can be modified by attachment to soluble 
macromolecules such as proteins, polysaccharides, or 
synthetic polymers. For example, the peptide could be 
coupled to styrene-maleic acid copolymers (see, e.g., 
Matsumura and Maeda, Cancer Res. (1986) 46:6387), 

2 0 methacrylamide copolymers (Kopececk and Duncan, J. Controlled 
Release (1987) 6:315), or polyethylene glycol (PEG) (e.g., 
Hershfield and Buckley, N. Enal. J. Med. (1987) 316:589; Ho 
et al.. Drug Metab. Dispos. (1986) 14:349; Chua et al . , AniLs. 
Intern. Med. (1988) 109:114). The agents, if desired, 

2 5 are further targeted by attachment to an antibody, especially 
a monoclonal antibody. Such antibodies include but are not 
limited to chimeric, single chain, Fab fracnnents, and Fab 
expression libraries. In one embodiment the agent is coupl d 
to the macromolecule via a degradable linkage so that it will 

30 be released in vivo in its active form. 

In another embodiment, the therapeutic or diagnostic 
agent may be delivered in a vesicle, in particular a 
liposome. see, Langer, science (1990) 249:1527-1533; Treat 
et al., in i^inosomes in the Therapy of Infectious Pjg^^s^ 
35 and Cancer , Lopez-Berestein and Fidler (eds.), Liss, New York 
(1989) pp. 353-365; Lopez-Berestein, ifei^w PP* 317-327. 
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In yet another embodiment, the therapeutic or in vivo 
diagnostic agent can b delivered in a controlled release 
system. In one embodiment, a pump may be used (se Langer, 
supra; Sefton, CRC Crit. Ref. Bi omed. Eng. (1987) 14:201; 
5 Buchwald et al. , Suraerv (1980) 88:507; Saudek et al. , IL^ 
Enal> J, Med> (1989) 321:574)- In another embodiment, 
polymeric materials may be used (see Medical Applications of 
Controlled Release, Langer and Wise (eds.)/ Pres,, Boca 

Raton, Florida, 1974; Controlled Drug Bioavailability, Drug 

10 Product Design and Performance, Smolen and Ball (eds.) Wiley, 

New York 1984; Raner and Peppas, J. Macromol. Sc?i> Rev> 

Macromol- Chem, (1983) 23:61; see, also, Levy et al., SQj^nQ^ 
(1985) 228:190; During et al., Ann. Neurol, (1989) 25:351; 
Howard et al., J. Neurosura. (1989) 71:105). In a preferred 

15 embodiment, a controlled release system may be placed next to 
the therapeutic target, thus requiring only a fraction of the 
systemic dose (see, e.g., Goodson, in Medical Applications of 
controlled Release , supra, (1984) 2:115-138). It will be 
recognized by one of ordinary skill in the art that a 

20 particular advantage of the invention is that a peptide yill 
not be subject to the problems of denaturation and 
aggregation associated with proteins held in the warm, most 
environment of a body in a controlled release system. 

Other controlled release systems are discussed in the 

25 review by Langer, in Science (1990) 249:1527-1533. 
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6 • EXAMPLES 

6.1. Preparation of th T8AR-9 Library 

6.I.I. Synth sis and Assambly of 
Oligonuclaotidas 

Figure 1 shows the formula of the 
oligonucleotides and the assembly scheme used in construction 
of the TSAR-9 library. The oligonucleotides were synthesiz d 
with an applied Biosystems 380a synthesizer (Foster City, 
OA) , and the full-length oligonucleotides were purified by 
HPLC. 

Five micrograms of each of the pair of oligonucleotides 
were mixed together in buffer (10 mM Tris-HCl, pH 8.3, 15 mM 
KCl, 0.001% gelatin, 1.5 mM magnesium chloride), with 0.1 % 
Triton X-100, 2 mM dNTP's, and 2 0 units of Tag DNA 
polymerase. The assembly reaction mixtures were incubated at 
72 for 30 seconds and then 30 *C for 30 seconds; this 
cycle was repeated 60 times. It should be noted that the 
assembly reaction is not PGR, since a denaturation step was 
not used. Fill-in reactions were carried out in a thermal 
cycling, device (Ericomp, LaJolla, CA) with the following 
protocol: 30 seconds at 72 ^C, 30 seconds at 30 *»C, repeated 
for 60 cycles. The lower temperature allows for annealing of 
the six base complementary region between the two sets of the 
oligonucleotide pairs. The reaction products were 
phenol/chloroform extracted and ethanol precipitated. 
Greater than 90% of the nucleotides were found to have been 
converted to double stranded synthetic oligonucleotides. 

After resuspension in 300 /iL of buffer containing 10 mM 
Tris-HCI, pH 7.5, 1 mM EDTA (TE buffer), the ends of the 
oligonucleotide fragments were cleaved with Xba I and Xho I 
(New England BioLabs, Beverly, MA) according to the 
supplier's recommendations. The fragments were purified by 
4% agarose gel electrophoresis. The band of correct size was 
removed and electroeluted, concentrated by ethanol 
precipitation and resuspended in 100 mL TE buffer. 
Approximately 5% of the assembled oligonucleotides can be 
expected to have internal Xho I or Xba I sites; however, only 
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the full-length molecules were used in the ligation step of 
the assembly scheme. The concentration of the synthetic 
oligonucleotide fragments was estimat d by comparing the 
intensity on an ethidium bromide stained gel run along with 
5 appropriate quantitated markers* All DNA manipulations not 
described in detail were performed according to Maniatis, 
supra . 

To demonstrate that the assembled enzyme digested 
oligonucleotides could be ligated, the synthesized DNA 

10 fragments were examined for their ability to self-ligate. 
The digested fragments were incubated overnight at 18 **C in 
ligation buffer with T4 DNA ligase. When the ligation 
products were examined by agarose gel electrophoresis, a 
concatamer of bands was visible upon ethidium bromide 

15 staining* As many as five different unit length concatamer 
bands (i.e., dimer, trimer, tetramer, pentamer, hexamer) were 
evident, suggesting that the synthesized DNA fragments were 
efficient substrates for ligation. 

20 6* 1.2. Construction of Vectors 

The construction of the Ml 3 derived phage 
vectors useful for expressing a TSAR library has been 
recently described (Fowlkes, D. et al- BioTech. (1992) 
13:422-427). To express the TSAR-9 library, an M13 derived 
25 vector, m663, was constructed as described in Fowlkes. The 
m663 vector contains the pIII gene having a c-myc-ep it ope, 
i*e., as a stuff er fragment, introduced at the mature N- 
terminal end, flanked by Xho I and XJba I restriction sites 
(see also, Figure I of Fowlkes) . 

30 

6.1,3, Expression of the TSAR-9 Library 

The synthesized oligonucleotides were then 
ligated to Xho I and XJba I double-digested m663 RF DNA 
containing, the pIII gene (Fowlkes) by incubation with ligase 
35 overnight at 12 ^C. More particularly, 50 ng of vector DNA 
and 5 ng of the digest d synthesized DNA and was mixed 
togeth r in 50 ^h ligation buffer (50 mM Tris, pH 8.0, 10 mM 
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MgClj, 20 mM DTT, 0-1 mM ATP) with T4 DNA ligase. After 
overnight ligation at 12 the DNA was concentrated by 
thanol precipitation and wash d with 70% thanol* The 
ligated DNA was then introduced into E. coli (DH5aF' ; GIBCO 
5 BRL, Gaithersburg, MD) by electroporation. 

A small aliquot of the electroporated cells was plated 
and the number of plaques counted to determine that 10* 
recombinants were generated. The library of E. coli cells 
containing recombinant vectors was plated at a high density 

10 (-400,000 per 150 mM petri plate) for a single amplification 
of the recombinant phage. After 8 hr, the recombinant 
bacteriophage were recovered by washing each plate for 18 hr 
with SMG buffer (100 mM NaCl, 10 mM Tris-HCl, pH 7.5, 10 mM 
MgClj, 0.05% gelatin) and after the addition of glycerol to 

15 50% were frozen at -80 *>C. The TSAR-9 library thus formed 
had a working titer of -2 x lo" pfu/ml. 

6.2. Preparation of T8AR-12 Library 

Figure 2 shows the formula for the synthetic 
20 oligonucleotides and the assembly scheme used in the 

construction of the TSAR-12 library. As shown in Figure 2, 
the TSAR-12 library was prepared substantially the same as 
the TSAR-9 library described in Section 6.1 above with the 
following exceptions: (1) each of the variant non-predicted 
25 oligonucleotide sequences, i.e., NNB, was 30 nucleotides in 
length, rather than 54 nucleotides; (2) the restriction sites 
included at the 5' termini of the variant, non-predicted 
sequences were Sal I and 5pe 1, rather than Xho I and Xba I; 
and (3) the invariant sequence at the 3» termini to aid 
30 annealing of the two strands was GCGGTG and CGCCAC rather 
than CCAGGT and GGTCCA (5» to 3')- 

After synthesis including numerous rounds of annealing 
and chain extension in the presence of dNTP's and lag DNA 
polymerase, and purification as described above in Section 
35 6.1.1, the synth tic d uble strand d, oligonucleotide 
fragments wer digested with Sal I and Spe I restriction 
enzymes and ligated with T4 DNA ligase to the nucleotide 
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sequence encoding the M13 pIII gene contained in the m663 
vector to yield a library of TSAR-expression vectors as 
described in Sections 6.1.2 and 6.1, 3. The ligated DNA was 
then introduced into E. coli (DHSaF'; GIBCO BRL, 
5 Gaithersburg, MD by electroporation. The library of E. coli 
cells were plated at high density (-400,000 per 150 mm petri 
plate) for amplification of the recombinant phage. After 
about 8 hr, the recombinant bacteriophage were recovered by 
washing, for 18 hr with SMG buffer and after the addition of 
10 glycerol to 50% were frozen at -80 

The TSAR- 12 library thus formed had a working titer of 
-2 X 10** pfu/mL. 

6.3. Characterization of the TSAR-9 and -12 Libraries 

15 The inserted synthetic oligonucleotides for each of 

the TSAR libraries, described in Sections 6.1 and 6.2 above, 
had a potential coding complexity of 20^ (-10*"') and 20^, 
respectively, and since -lO'* molecules were used in each 
transformation experiment, each member of these TSAR 

2 0 libraries should be unique. After plate amplification the 
library solution or stock has 10* copies of each member/mL. 

It was observed that very few (<10%) of the inserted 
oligonucleotide sequences characterized so far in both of the 
libraries have exhibited deletions or insertions. This is 

25 likely a reflection of the accuracy assembling the 

oligonucleotides under the conditions used and the fact that 
certain types of mutations (i.e., frame-shifts) would not be 
tolerated as pIII an essential protein for phage propagation. 
In order to determine whether any coding bias existed in 

30 the variant non-predicted peptides expressed by these 
libraries, perhaps due to biases imposed in vitro during 
synthesis of the oligonucleotides or in vivo during 
expression by the reproducing phage, inserts were sequenced 
as set forth below. 

35 

6.3.1. Charact risation of TSAR-9 Library 
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Insert d synthetic oligonucleotide fragments 
of 2 3 randomly chosen isolates were examined from the TSAR-9 
library. Individual plaques were used to inoculate I ml of 
2XYT broth containing E. coli (DH5aF*) cells and the cultures 
5 were allowed to grow overnight at 37 with aeration. DNA 
was isolated from the culture supernatants according to 
Maniatis, supra. Twenty-three individual isolates were 
sequenced according to the method of Sanger fProc. Natl. 
Acad. Sci. USA (1979) 74:5463-5467) using as a primer the 

10 oligonucleotide 5 • -AGCGTAACGATCTCCCG (SEQ ID NO. 99), which 
is 89 nucleotides downstream of the pIII gene cloning site of 
the m663 vector used to express the TSARS. 

Nucleotide sequences and their encoded amino acid 
sequences were analyzed with the MacVector computer program 

15 (IBI, New Haven, CT) . The Microsoft EXCEL program was used 
to evaluate amino acid frequencies. Such analyses showed 
that the nucleotide codons coding for and hence most amino 
acids, occurred at the expected frequency in the TSAR-9 
library of expressed proteins. The notable exceptions were 

2 0 glutamine and tryptophan, which were over- and under- 

represented , respectively . 

It is of interest to note the paucity of TAG stop codons 
in the inserts, i.e., only 2 of -200 isolates characterized 
contained a TAG stop codon. About half [l-(47/48)^] of the 
25 phage inserts were expected to have at least one TAG codon in 
view of the assembly scheme used. However, most of the TAG- 
bearing phage appear to have been lost from the library, even 
though the bacterial host was supE. This may be a 
consequence of suppression being less than 100% effective. 

3 0 The amino acids encoded by the inserted double strand d 

synthesized oligonucleotide sec[uences, excluding the fixed 
PG-encoding centers, were concatenated into a single sequenc 
and the usage frequency determined for each amino acid using 
the Microsoft EXCEL program. These frequencies were compared 
35 to that expected from the assembly scheme of the 

oligonucl otides, and the divergence from expect d values 
repres nt d by the size of the bars above and below the 
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baseline. Chi square analysis was used to d termine the 
significance of the deviati ns. The majority of amino acids 
were found to occur at the expected fr quency, with th 
notable exceptions that glutamine and tryptophan were 
5 somewhat over- and under-represented, respectively. Thus, 
except for the invariant Pro-Gly, any position could have any 
amino acid; hence, the sequences are unpredicted or random. 

6.3.2. Characteriaation of TSAR-12 Library 

xo Approximately 10 randomly chosen inserted 

oligonucleotides from the TSAR-12 library were examined by 
DNA sequencing as described above in Section 6.3.1. The 
isolates were chosen at random from the TSAR-12 library and 
prepared for sequencing, as were the TSAR-9 isolates. 

15 Analysis showed that except for the invariant Gly any 

position could have any amino acid; hence, the sequences are 
unpredicted or random. 

6.4. Preparation of R8C Library 

20 Referring now to Figure 3, two oligonucleotides 

were synthesized on an Applied Biosystems Model 380a machine 
with the sequence 5*- 

TGACGTCTCGAGTTGTNNKNNKNNKNNKNNKNNKNNKNNKTGTGGATCTAGAAGGATC-3^" 
(SEQ ID N0:31) and 5 • -GATCCTTCTAGATCC-3 • (SEQ ID NO:32), 

25 where N is an equimolar ratio of deoxynucleotides A, C, G, 

and T, and K is an equimolar ratio of G and T. Fifty pmol of 
each oligonucleotide was incubated at 42 »C for 5 min, then 
37 •C for 15 min, in 50 of Sequenase™ buffer (U.S. 
Biochemicals, Cleveland, OH) with 0.1 /ig/ML acetylated BSA, 

30 and 10 ttM DTT. After annealing, 10 units of Sequenase™ (U.S. 
Biochemicals) and 0,2 mM of each dNTP were added and 
incubated at 3 7 <»C for 15 min. The sample was then heated at 
65 •C for 2 hr, digested with 100 units of both Xho I and Xba 
I (New England BioLabs, Beverly, MA), phenol extracted, 

35 ethanol precipitated, and resolved on a 15% non-denaturing 
polyacrylamide gel. The assembled, digested fragment was gel 
purified prior to ligation. The vector, m663 (Fowlkes, D. t 
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al. BioHech . (1992) 13:422-427), was prepared by dig stion 
with Xho I and Xba I, calf alkaline phosphatase (Bo hring r 
Mannheim, Indianapolis, IN) treatm nt, phenol extracted, and 
purified by agarose gel electrophoresis. To ligate, 20 ng 
5 vector was combined with 0.2 Mg insert in 3 mL with T4 DNA 
ligase (Boehringer Mannheim), according to the manufacturer. 
After removal of the protein and buffer by phenol extraction 
and ethanol precipitation, the ligated DNA was electroporated 
into XLl-Blue E. coli (Stratagene, San Diego, CA) and plated 
10 for eight hours at 37 "C. To recover the recombinant phage, 
the top agar was collected with a spatula, mixed with an 
equal volume of 100 mM NaCl, 10 mM MgClj, and 50 mM Tris-HCI 
(pH7.5), and disrupted by two passes through an l8-gauge 
syringe needle. The bacterial cells were removed by 
15 centrifugation, and phage particles were collected by 

polyethylene glycol precipitation and stored at -70 'C in 25% 
glycerol. The library had 10* total recombinants and a 
working titer of 6 x l0"pfu/mL. 

Members of the library were checked for inserts by th 
20 polymerase chain reaction (Saiki, et al. ggj-epce (1988) 
239:487-491). Individual plaques on a petri plate were 
touched with a sterile toothpick and the tip was stirred into 
2xYT with F*£. coli bacteria and incubated overnight at 37 
with aeration. Five microliters of the phage supernatant 
25 were then transferred to new tubes containing buffer (67 mM 
Tris-HCl, pH 8.8/10 mM /S- mercaptoethanol/ 16 . 6 mM ammonium 
sulfate/6.7 mM EDTA/50 ng bovine serurt albumin per mL) , 0.1 
mM deoxynucleotide triphosphates, and 1.25 units of Tag DNA 
polymerase (Boehringer Mannheim, Indianapolis, IN) with 100 
3 0 pmoles of oligonucleotide primers. The primers flanked the 
cloning site in gene III of m663 (5 '-TTCACCTCGAAAGCAAGCTG-S • 
(SEQ ID NO: 100) and 

5 • -CCTCATAGTTAGCGTAACG-3 • (SEQ ID NO: 101)). The assembly 
reactions were incubated at 94 for 1 min, 56 'C for 2 min, 
35 and 72 •C for 3 min; this cycle was repeated 24 times. Th 
reaction products wer then resolved by electrophoresis on a 
NuSieve 2.0% agaros g 1 (FMC, Rockland, ME). Gels reveal d 
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that for 20 plaques tested, all were recombinant and had 
single inserts of the xpected size* 

Based on the sample size of the library, it was 
anticipated that 100% of the recombinants had single inserts. 
5 However, all of the SH3-binding phage isolated from the R8C 
library had double- inserts* Such phage are presumed rare 
(i.e., <5%) within the library, yet because the SH3-binding 
peptide appears to need to be linear they were selected for 
by our screening methods. Most likely they were formed 
10 during the generation of the library; one scenario is that 
the inserts ligated together to form head-to-head dimers and 
that they were subsequently cloned into m663 DMA by ligation 
with the vector's Xho I sticky end and by illegitimate 
ligation with the vector's Xba I site (see. Figure 4). 

15 

6.5. Preparation Of Target-Coated Microtiter Wells 

6.5.1. Preparation Of GST-SH3 Fusion Proteins 

The preparation of Src-GST fusion protein 
was first described by Smith and Johnson, in Gene (1988) 

2 0 67:31, the disclosure of which is incorporated by reference 
herein. Briefly, pGEX-derived (Pharmacia, Piscataway, NJ) 
constructs expressing GST fusion proteins containing the SH3 
domains of Src, Grb2 , Crk, Abl, or PLC7 were obtained from 
Dr. Channing Der (University of North Carolina at Chapel 

25 Hill) ; a construct expressing the SH3 domain of Yes was 

obtained from Dr. Marius Sudol (Rockefeller University). The 
use of the pGEX baccerial expression vector for the 
production of GST-SH3 fusion proteins is well-known to those 
in the art. See, e.g., Cicchetti, P. et al., in Science 

30 (1992) 257:803-806- Briefly, the coding region for a 

particular SH3 domain can be fused in-frame at the Bam HI 
site of PGEX-2T. Thus, fusion proteins were prepared as per 
the manufacturer's instructions, and quantified by Coomassie 
Blue staining of SDS-polyacrylamide gels. Microtiter wells 

35 were coated with 5-20 ^ig GST-SH3 fusion protein in 100 mM 
NaHCOj, pH 8.5, blocked with 100 mM NaHCOj (pH 8.5) 1% BSA, 
and washed. All washes consisted of five applications of 
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IXPBS, 0.1% Tween 20, 0.1% BSA (Buffer A). Where 
appropriate, the amount of protein bound to each well was 
quantified with an anti-GST antibody-based ELISA (Pharmacia, 
Piscataway, NJ) , and with a GST-binding phage, isolated 
5 during the course of this work. 

6. 5.2. Coating of Microtitar Wells 

Bacterially expressed Src SH3 glutathione-S- 
transferase (Src-GST) fusion protein was purified from 
bacterial lysates using glutathione agarose 4B (Pharmacia) , 

10 according to the manufacturer's instructions. Bound Src-GST 
fusion protein was eluted from the glutathione agarose with 
10 mM glutathione in PBS. Microtiter wells were then coated 
with Src-GST fusion protein (1-10 Mg/well, in 50 mM NaHCOj, pH 
8.5) overnight at 4 ^C. To block non-specific binding of 

15 phage, 100 1% BSA in 100 mM NaHCOj, pH 8.5, was added to 
each well and allowed to incubate at room temperature for 1 
hour. The wells were then washed five times with 200 /iL PBS, 
0.1% Tween 20, 0.1% BSA (Buffer A). 

20 6. 6. Biopanning And Subsequent Character isat ion Of 

Phage-Displayed Randoa Peptide Libraries 
With Bro-G8T Fusion Protein As Target Molecule 

6,6.1. Isolation of Src 8H3-Binding Phage 

Library screens were performed as 
previously described. Kay, B.K., et al., in Gsns (1993) 
128:59-65. Briefly, 1 X lO" pfu TSAR 9, TSAR 12, or R8C 
phage in Buffer A were incubated in a Src SH3 -GST-coated well 
for 2 hours. The wells were washed, and bound phage were 
eluted with 100 /iL 50 mM glycine HCl (pH 2.2), transferred to 
a new well, and neutralized with 100 mL 2 00 mM NaHP04 (pH 
7.0). Recovered phage were used to infect 1 x 10* DHSaF* 
coli cells in 20 mL 2xYT; the infected cells were grown 
overnight, resulting in a 1000- to 10,000-fold amplification 
of phage titer. Amplified phage were panned twice more, as 
above, excepting the amplification step. Binding phage 
recovered after the third round of panning were plat d at a 
low density on a lawn of DH5aF« E. coli cells to yield 
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isolated placjues for clonal analysis. Isolated plaques were 
used to produce small cultures from which phage stocks and 
DNA were recovered for phage binding experiments and dideoxy 
sequencing (Sanger, F. , et al., in Proc. Natl. Acad. Sci. USA 
5 (1977) 74:5463-5467), respectively. Clones were confirmed as 
binding the SH3 domain by applying equal titers of phage to 
wells containing Src SH3-GST or GST alone, and titer ing the 
number of eluted particles from each well, or detecting bound 
phage with an anti-phage antibody-based ELISA (Pharmacia) . 

10 Indeed, the ability of isolated phage clones to bind to 

several SH3 domains derived from a variety of different 
proteins can be investigated by the manner described above. 
GST-SH3 fusion proteins containing SH3 domains from a variety 
of different proteins are bound to microliter wells. An 

15 aliquot of the aforementioned phage stocks (50 /xL) is 

introduced into wells containing the different GST-SH3 fusion 
proteins. After room temperature incubation for 1-2 hours, 
the liquid contents of the microtiter plates are removed, and 
the wells are washed 5 times with 200 mL Buffer A. Bound 

20 phage are eluted with 100 mL 50 mM glycine (pH 2.2), 

transferred to a new well, and neutralized with 100 mL 2 00 mM 
NaHP04 (pH 7.0). The phage are diluted lO'^- to lO^-fold, and 
aliquots are plated onto lawns of DH5aF' E. coli cells to 
establish the number of plaque forming units in the output 

25 sample. From these experiments, the relative specificity of 
different Src SH3 binding clones for SH3 domains derived from 
other proteins is determined. 

6. 6.2. Phage ELISA and Nucleotide Sequencing 

30 To evaluate the binding of isolates to 

various targets proteins, enzyme-linked- immuno-assays (ELISA) 
were also performed. Bacterial cultures were infected with 
phage isolates and cultured overnight in 2XyT at 3 7 **C. The 
cells were spun down and 25 mL of supernatant was added to 

35 microtiter plate wells coated with 50 mL of protein (1 mg/mL 
in 100 mM NaHCOj, pH 8.4; overnight at 4 or for a few hours 
at room temperature) and blocked (1 mg/mL BSA in 100 mM 
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NaHCO}, pH 8.4; for about one hour). The phage are incubated 
in the well with 25 mL of PBS-0.1% Tween 20 at RT for 2 hr. 
The wells are then washed multipl times over 3 0 minut s. To 
each well is added 50 /iL of polyclonal anti-phage antibody 
5 conjugated to horseradish peroxidase* The antibody is 
diluted 1:3000 in BSA-PBS-Tween 20; it was obtained from 
Pharmacia (Piscataway, NJ; catalog number 27-9402-01) . Aft r 
3 0 minutes, the wells are washed again with BSA-PBS-Tween 20 
for '20 minutes. Finally, 100 mL of ABTS reagent (Pharmacia, 

10 with HjOj) are added to each well for the development of 
color. Plates are read with a plate reader (Molecular 
Devices, Menlo Park, CA) at 405 nm wavelength- 

The nucleotide sequence of the relevant segments of the 
Src SH3 binding clones (or phage clones that bind to SH3 

15 domains of other proteins) were sequenced using standard 
methods. Sanger, F., et al., in Proc. Natl. Acad. Sci. USA 
(1977) 74:5463-5467. The oligo primer 5 • -AGCGTAACGATCTAAA-3 • 
(SEQ ID NO: 102) was used, which is 89 nucleotides downstream 
of the gene III cloning site of M13 m666. The nucleotide 

2 0 sequences were analyzed with the MacVector computer program 
(IBI, New Haven, CT, USA). From this nucleotide sequence 
information the primary sequence of each Src SH3 binding 
peptide was deduced. The corresponding synthetic peptides 
were then prepared by techniques well known in the art with 

25 or without flanking sequences. Indeed, these synthetic 

peptides have been shown to bind to SH3 domain targets, with 
those possessing the phage flanking amino acid residues 
exhibiting greater binding affinity. 

30 6.7 In Vitiro Peptide Binding Assays 

Peptides were obtained from Research Genetics 
(Birmingham, AL) , Chiron Mimotopes (Victoria, Australia), or 
synthesized by conventional techniques by Dr. J. Mark Carter 
of Cytogen Corporation (Princeton, NJ) . Peptide purity was 
35 assessed by HPLC and/or mass spectrometry. Biotinylat d 

peptides were synthesized with either a KSGSG (SEQ ID NO: 103) 
or a GSGS (SEQ ID NO: 104) peptide linker (a spacer) betw en 
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the biotin and the N-terminus of the peptide. Binding 
experiments wer performed as above, excepting the use of 10 
peptide instead of phage. Bound biotinylated p ptide was 
detected with streptavidin conjugated to alkaline phosphatase 
5 (Sigma Chemical Co., St. Louis, MO). After one hour 

incubation period at room temperature, the wells were washed, 
and a solution of 3 mM p-nitrophenyl -phosphate (US 
Biochemicals, Cleveland, OH) in 50 mM NaCO, (pH 9.8), and 50 
mM MgClj was added and color allowed to develop. Signals were 
10 read with an ELISA plate reader (Molecular Devices, Menlo 
Park, CA) at 405 nm wavelength. Binding experiments were 
performed in triplicate. The results are presented in 
Figures 7 and 8 . 

15 6.8. Peptide competition of GST-8H3 Affinity 

Precipitations of Cell Lysates 

Labeled proteins are prepared by incubating a 

culture of HeLa cells overnight with >100 nCi/mL. "s- 

methionine. The cells are then washed and lysed with mild 

2^ detergent. This mixture of radioactive proteins is incubated 
with Src-GST fusion protein that has been immobilized on 
glutathione-linked Sepharose beads (Pharmacia, Piscataway, 
NJ) . After several hours of tumbling, the beads are pelleted 
gently by low-speed centrifugation, and the supernatant is 

25 discarded. The beads are then resuspended into a slurry in 
PBS-0.1% Tween 20, pelleted, and washed several additional 
times. Finally, a 2% SDS solution is added to the sample, 
which is then boiled at 100 "C for 3 minutes. Afterward, the 
sample is centrifuged, and the supernatant loaded on a 10% 
polyacrylamide SDS gel for electrophoresis. After the 
proteins have been resolved, the gel is fixed, dried down, 
and exposed to X-ray film for autoradiography or phosphor 
plates for scanning by a Molecular Dynamics Phosphorlmager . 
The ability of Src SH3 to bind certain "s-labeled 

35 proteins is examined for competability with exogenous 
peptides. Synthetic peptides corresponding to phage- 
displayed ins rts and motifs are added at the time that the 
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lysat is incubated with the Src-GST fusion protein 
immobilized on glutathione-1 inked sepharose beads. The SH3 
binding peptides block binding of all or some of the labeled 
proteins while negative control peptides (unrelated peptide 
5 sequences) do not. The amount of competition is quantified 
and correlated with the amount of added SH3-domain binding 
peptides. 

Alternatively, NIH 3T3 cells were grown in Dulbecco's 
Modified Eagle Medium (DME) + 10% fetal calf serum (FCS) + 80 

10 nCi/mL Tran"Slabel (ICN) , washed with PBS, lysed in RIPA 
buffer, and pelleted. Supernatant from 1.5 x 10* cells was 
precleared with 100 fiq glutathione-agarose-immobilized GST. 
The supernatant was then incubated with 10 glutathione- 
agarose-immobilized GST-SH3 fusion protein with or without 

15 added test peptide in a final volume of 250 nL. Pelleted 
beads were washed with 1 mL each of RIPA, RIPA + 1% 
deoxycholate + 0.1% SDS, and PBS, resuspended in 50 mL 
SDS-PAGE sample buffer, boiled, and subjected to SDS-PAGE 
(7.5%). Labeled proteins were detected by phosphor imaging 

20 (Molecular Dynamics). The results are presented in Figure 9. 

6.9. Feptid* Competition of OST-SBS Affinity Precipit- 
ations of PZ-3* Kinase From Cell Lysatea 

It is possible to follow the precipitation of PI-3 • 

25 Kinase by Src from cell lysates in the presence or absence of 

SH3-binding peptides. HeLa cells are lysed with detergent 

and the protein mixtures are incubated for several hours with 

the Src-GST fusion protein immobilized on glutathione-linked 

Sepharose beads. After several hours of tumbling, the beads 

2^ are pelleted gently by low-speed centrif ugation and the 

supernatant is discarded. The beads are then resuspended 

into a slurry in PBS-0.1% Tween 20, pelleted, and washed 

several additional times. Finally, an SDS solution is added 

to the sample, which is then boiled at 100 -C for 3 minutes. 

35 Subsequently, the sample is centrif uged, and the supernatant 

is loaded on a 10% polyacrylamide SDS gel for 

electrophoresis. After the proteins hav been resolved, the 
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gel is blotted to nitrocellulose or nylon (i.e*, western 
blot) . The filter is then probed with a PI-3 » Kinase 
antibody (monoclonal and polyclonal antibodies are available 
from Upstate Biotechnology Incorporated, Lake Placid, NY) and 
5 an enzyme-linked secondary antibody. The amount of PI-3 • 
Kinase is then quant itated. 

The ability of Src SH3 to bind PI-3' Kinase is examined 
for competability with exogenous peptides. Synthetic 
peptides corresponding to phage-displayed inserts and motifs 

10 are added at the time that the lysate is incubated with the 
Src-GST fusion protein that has been immobilized on 
glutathione- linked sepharose beads. Ten- fold and one 
hundred-fold molar excess of peptides are used relative to 
SH3 proteins. The SH3 binding peptides block binding of the 

15 PI-3' Kinase as detected on western blots while negative 
control peptides (unrelated peptide sequences) do not. The 
amount of competition is quantified and correlated with the 
amount of added SH3-domain binding peptides. 

20 6.10. In Vivo Association Of SH3-Bindiiig Peptides . 

With 8H3-Doi&aina Of Proteins 

To demonstrate association of the SH3 -binding 
peptides with SH3-domains of proteins inside cells, the SH3- 
binding peptides are tagged and localized in cells. For 
example, Bar-Sagi et al., in Q^U (1993) 74:83-91, have shown 
that SH3-binding proteins localize to the cytoskeleton when 
expressed in cells. Thus, the SH3 domain-binding peptides of 
the present invention can serve as cellular targetting 
signals (e.g., to the cytoskeleton). Accordingly, the 
peptides are tagged with biotin and, subsequently, injected 
into cells. Alternatively, one can transfect into cells a 
recombinant plasmid that expresses a fusion protein 
comprising of the SH3-binding peptide and the green 
fluorescent protein (GFP, Chalfie et al., in ggj^nge (1994) 
263:802-805). The location of the biotinylated peptide or 
the GFP fusion protein is then assayed with FITC-labeled 
streptavidin in paraf ormaldehyde-f ixed cells or by direct 
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fluor scence in living cells, respectively. Localization of 
the SH3 -binding peptides to th cytoskeleton demonstrates 
that the SH3 -binding peptides can bind SH3 -domain proteins in 
vivo. In addition, focal adhesions, which are rich in Src, 
5 are also sites of potential subcellular localization of SH3- 
binding peptides • 

Thus, NIH 3T3 fibroblasts were cultured in vitro on 
glass coverslips coated with fibronectin. After two days of 
growth at 37 «C, the cells were fixed for one hour at room 
10 temperature in the presence of 2% paraformaldehyde (pH 7.5). 
The coverslips were washed with PBS-0.1% Tween 20 several 
times to remove the fixative. Next, the coverslips were 
dipped into acetone (chilled at -20 «C) for approximately 20 
seconds and allowed to air-dry. The coverslips were washed 
15 again with PBS-0.1% Tween 20, containing BSA (1 mg/mL) , and 
incubated for 2 hours at room temperature with different 
biotinylated peptides in PBS-0.1% Tween 20. The coverslips 
were washed and then incubated with 1 mg/mL streptavidin-Cy3 
(Jackson Immunoresearch Co., West Grove, PA) for 1 hour at 
20 room temperature. Finally, the coverslips were washed in 
PBS-0.1% Tween 20, mounted in a glycerol solution on a glass 
slide, and viewed with a Nikon Optiphot epif luorescence 
microscope and a 60x oil immersion lens. 

The results are presented in Figure 11, in which panel A 
25 displays cells stained with the conjugate biotin-spacer- 

VLKRPLPIPPVTR (SEQ ID NO:64); panel B exhibits cells stained 
with the conjugate, biotin-spacer-GILAPPVPPRNTR (SEQ ID 
NO: 63); panel C shows cells stained with the long consensus 
peptide, biotin-spacer-RSTPRPLPPLPTTR (SEQ ID NO: 67); and 
30 panel D shows cells stained with the proline-rich vinculin 
peptide conjugate, biotin-spacer-LAPPKPPLPEGEV (SEQ ID 
NO:70). The "spacer" sequence is KSGSG (SEQ ID N0:103). As 
shown in Figure 11, the panels in which SH3 domain-binding 
peptides were used present a bright display of fluorescence 
35 activity that is in sharp contrast to the relatively "dark" 
featur s of panel D (non-SH3 domain binding vinculin 
segment) . These results d monstrate further the ability of 

- 50 - 

SUBSTITUTE SHEET (RULE 26) 

BNSDOaD: <WO_oe036«A1JlA> 



wo 96/03649 



PCT/US95/09382 



the SH3 domain-binding peptides of the present invention to 
localize to protein targets (e.g., Src and Src-related 
prot ins) within cells and provide an image thereof. 

5 6.11. in Vivo Modulation Of Src In Ooeytes With 

8B3-Binding Peptides 

When XBnopus laevis oocytes are injected with 
mRNA encoding deregulated Src, there are dramatic cytological 
and biochemical changes in the oocyte (Unger, T.F. and 
Steele, R.E. , in MoT. Cell. Biol. (1992) 12:5485-5498). The 
applicants have obtained the plasmid for generating Src mRNA, 
which is available from Dr. Robert Steele (University of 
California at Irvine) . Synthetic SH3-binding peptides are 
injected into oocytes that have been previously injected with 
Src mRNA. The state of the cytoskeleton is inspected 
visually by observing the arrangement of cortical pigment 
granules under a dissecting microscope. The state of 
phosphorylation of several proteins is examined by western 
blotting with an anti-phosphotryosine monoclonal antibody 
(4G10; Upstate Biotechnology Incorporated) , as described in 

2 0 

Unger and Steele, above. 

€.12. Progesterone- induced X. laevis oocyte 

Maturation 

Segments of adult ovary were removed 
surgically and incubated in 0.1% collagenase type D 
(Boehringer Mannheim, Indianapolis, IN) in Ca^*-free OR2 (82.5 
mM NaCl, 2.5 mM KCl, 1.0 mM HgClj. 1-0 mM Na2HP04, 5.0 mM 
HEPES, and 3 . 8 mM NaOH, pH 7.6). Oocytes were then washed 3- 
5 times with OR2 containing 1.0 mM CaClj and allowed to 
'° recover in OR2 overnight at 18 "C. Stage VI oocytes were 
injected with 40 nL of 100 mM peptide or water. After 
injection, the oocytes were placed in 0R2 with 2 mg/mL 
progesterone (Sigma, St Louis, MO) and incubated at 20 "C. 
Oocytes were scored at hourly time points for germinal 
vesicle breakdown (GVBD) . 
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Figure 10 presents the results of this experiment. As 
shown by the graph, oocytes injected with the SH3 domain- 
binding peptide VLKRPLPIPPVTR (SEQ ID NO: 64) xhibit a faster 
rate of progesterone- induced germinal vesicle breakdown 
S relative to oocytes that had been injected with water or with 
the proline-rich vinculin peptide, LAPPKPPLPEGEV (SEQ ID 
NO:70). These results parallel those obtained by Unger and 
Steele, supra, who observed a gross alteration in the cortex 
of oocytes that had been injected with RNA encoding a 
10 deregulated Src variant ("active" Src) versus those injected 
with RNA encoding the wild-type Src ("cellular" Src). Also, 
as shown in Figure 3B of the Unger and Steele article, 
oocytes injected with deregulated or active Src RNA matured 
at a faster rate than oocytes injected with water or wild- 
15 type Src RNA. 

The present results obtained with Src SH3 domain-binding 
peptides suggest that these peptides modulate the biochemical 
activity of "cellular" Src; in particular, it is proposed 
that at least some of the Src SH3 domain-binding peptides of 
20 the present invention upregulate the biochemical activity of 
"cellular" Src, which may be downregulated or inhibited in 
its normal state. Hence, the administration of the SH3 
domain-binding peptides of the present invention can 
constitute a novel method of modulating the activity of Src 
25 or Src-related proteins. Specifically, certain of these 
peptides are able to activate Src-family proteins. 

In Vivo Antagonism Of Src In Src Transformed 
cells With 8H3-Binding Peptides 

The coding regions for SH3 -binding peptides 

... 
are cloned into vectors that direct their expression in 

animal cells. A bipartite gene is constructed, encoding a 

protein with c-myc epitope and SH3 -binding peptide, which is 

transcribed from a strong constitutive promoter (e.g., SV40, 

CMV, HSV TK, calmodulin) . The vector is introduced into 

either normal or Src-transf ormed cells via transfection 

(e.g., electroporation, calcium phosphate, liposomes, DEAE 
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dextran) . Transfected cells express the bipartite g ne 
transiently in culture. To create stable transformed cell 
lines, the vector carries a selectable marker (e.g., neomycin 
resistance) or transfection is performed in the presence of 
5 excess plasmid carrying a selectable marker (e.g., neomycin 
resistance) and cells selected for the marker. Transfected 
cells are stained by immunofluorescence to detect expression 
of the bipartite protein. The hybridoma 9E10 secretes a 
monoclonal antibody that is highly specific for the c-myc 
10 epitope (EQKLISEEDLN [SEQ ID NO:105]; see, Evan, G.A. et al. , 
in MoT, cell. Biol . (1985) 5:3610-3616). This antibody is 
used in immunofluorescence experiments to demonstrate that 
the bipartite protein is expressed inside the cells, and in 
some cases, localized to subcellular structures enriched in 
15 SH3 domain bearing proteins. 

There are several controls used in these experiments. 
First, cells are transfected with vectors that do not have 
the SH3 -binding peptide coding region. Second, normal (non- 
transformed) cells are transfected. Third, cells transformed 
20 by oncogenes other than Src are used in the transfection 

experiments. Fourth, cells are stained with other monoclonal 
antibodies that do not recognize the c-myc epitope. 

Transfected cells are examined for any changes in cell, 
shape, behavior, and metabolism as a consequence of 
2 5 expressing the SH3 binding peptides. Cell shape is examined 
by phase contrast microscope at several times after 
transfection; in particular, the flatness of the cells, their 
adhesion to the substrate, and the degree of cell ruffling 
are monitored. Cell division rates, cell migration, and 
30 contact inhibition are also observed over time. Finally, the 
amount of phosphorylated tyrosine in transfected cells is 
quantitated by phosphoaminoacid analysis and with an anti- 
phosphotryosine monoclonal antibody (4G10; Upstate 
Biotechnology Incorporated) in western blotting experiments. 

35 
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6.14 Pr parmtion f PXXP Bias d V ptid Llbrariaa 

The preparation and charact rization of preferred phage- 
displayed random peptide libraries have been described above 
in Sections 6.1 - 6.4. 
5 Using procedures similar to those described in these 

sections, oligonucleotide inserts were contructed according 
to the schematic provided in FIG. 12. The inserts were then 
cloned into the mBAX vector, and the biased peptide librari s 
were expressed as described previously. 
10 The mBAX vector was created in the Kay Laboratory by 

generating cloning sites in gene III of the M13mpl8 vector 
(Messing, J. (1991). Cloning in M13 phage or how to use 
biology at its best. Gene 100, 3-12) in the manner of 
Fowlkes et al. (1992) (Biotechnigues 13, 422-427). The mBAX 
15 vector displays a peptide sequence at the N-terminus of the 
mature pIII protein that encodes the epitope for the mous 
monoclonal antibody 7E11 (see, FIG. 13); it includes the stop 
codon TAG in the coding region, which is suppressed in E, 
coli carrying suppressor tRNA gene mutations known as supE or 
2 0 supF. There are no other stop codons in the mBAX genome. 

The mBAX vector also carries a segment of the alpha fragment 
of ft-galactosidase; bacterial cells expressing the omega 
fragment of 6-galactosidase convert the clear XGal substrat 
into an insoluble blue precipitate. The plaques appear blu . 
2 5 Recombinant mBAX molecules can be distinguished easily 

from non-recombinant molecules due to the TAG codon in the 
Xhol - Xbal segment in gene III of mBAX. When recpijibinants 
are generated by replacing the Xho I - Xha I fragment with 
oligonucleotides encoding random peptides, the recombinants 
30 can be grown in bacteria with (i.e., DHSaF') or without 

(i.e., JS5) suppressor tRNA mutant genes. On the other hand, 
the non-recombinant mBAX molecules fail to produce plaques on 
bacterial lawns where the bacteria (i.e., JS5) lack such 
suppressor genes. This is because in JS5, the TAG codon 
35 serves as a stop codon to yield a truncated pIII molecule 
during translation; since pIII is an essential prot in 
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component of viable M13 viral particles, no plaques will 
form. 

6.14.1 Discussion 

The use of second generation or biased peptide 
5 libraries, which fix all or part of the RPLPPLP consensus 
motif for SH3 binding peptides and randomize flanking 
residues, has defined additional sequence residues exhibiting 
selective SH3 binding. 

Tables 1-5, below, list the relevant amino acid 
10 sequences obtained from the biased peptide library for each 
set of SH3 domain binding peptides. The underscored amino 
acid residues indicate the fixed positions. Also, indicated 
for each set of new binders is a "consensus" sequence, which 
seeks to include the additional features gleaned from the new 
15 binding peptides. The symbol "0'* represents a hydrophobic 
residue. 



TABLE 1 



CORTACTIN SH3 -BINDING PEPTIDES 



20 







SEQ. ID NO. 


1 PXXP.CORT.M1/2/3.PP 


SSLLGPPVEPKEQTLFSFSR 


107 


1 PXXP . CORT . M4 . PP 


SRLGEFSKPPIEQKPTWMSR 


108 


PXXP.CORT.N2.PP 


SRTERPPLEQREDWLS Y SSR 


109 


PXXP . CORT . N3 . PP • INC 


SREPDWLCPNCPLL.LRSDSR 


IIP 


PXXP . CORT . 0 1 / 2 / 3 . PP 


SSSSHNSRPPLPEKPSWLSR 


111 


1 PXXP. CORT. 04 .PP 


SRliTPQSKEPLEPKPSAVSR 


112 


1 CONSENSUS 


KPP^PxKPxW 
R 


113 



25 



30 



35 
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TABLE 2 NCK SH3 -BINDING PEPTIDES 



10 









PXXP.NCK.Q1/4.PP 


SSLGVGWK£LP£MRTASL££ 




PXXP . NCK . Q2 / 3 . PP . INC 


SSVGFADRERPELRVESLSB 


115 


^XXP . NCK - Rl * PP ^ INC 


SSAGILRPPEKPXRSFSLSE 


lie 

i i o 


PXXP. NCK. R2.PP 


SSPYTGDVPIPPLRGASLSR 


X X / 


PXXP. NCK. R3 .PP 


SSIiMGSWPPVPPLRSDSLSB 


XXo 


PXXP. NCK. R4.PP 


SSIGEDTPPSPPTRRASLSE 


1 1 o 

X 1.7 


PXXP. NCK. S1/4.PP 


SRSLSEVSEKPEIRSVSL5K 


1 5 n 

X £ u 


1 PXXP. NCK. S2. PP. INC 


SSVSEGYSPPLPPRSTSLiSk 


X ^ J. 


1 PXXP. NCK. S3. PP 


S S S FTLAAPTPPTRS LS LSR 


122 


1 PXXP. NCK. Tl.PP 


SSPPYELPPRPPNRTVSLSE 


123 


1 PXXP. NCK. T2.PP 


SRWDGLAPPPEVRLS S LSR 


124 


PXXP . NCK , T3 . PP . INC 


SSIiGYSGAPVPPHRxSSLSE 


125 


PXXP. NCK. T4 .PP 


SSISDYSREPPEVRTLSLSR 


126 


1 CONSENSUS 


0xxxxxExPE0RSxSL 
T 


127 



25 



30 



35 
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TABLE 3 ABL SH3 BINDING PEPTIDES 



1 






SEQ* ID NO* 




PXXP.ABL.G1/2.PP 


SRGPRWSPPPVPLPTSLDSR 


128 


s 


PXXP.ABL.G3/4.PP 


SSPPDYAAPAIPSSLWVDSB 


129 




PXXP.ABL.H1/3/4.PP 


SSPPHWAPEAPEAMSPPISE 


130 




PXXP . ABL . H2 . PP . INC 


SSDRCWECEPW£AGGQRGSB 


131 




PXXP. ABL. 11/2/3. PP 


SSPPKFSPEPPEYWQLHASB 






PXXP. ABL. 14 .PP 


SSPPSFAPEAAPPRHSFGSB 


1 J J 


10 


PXXP. ABL. Jl.PP 


SSAPKKPAEPVPMMAHVMSB 


1 A 

IJ 4 




|PXXP . ABL. J2 . PP . INC 


SSPTYPPPPPPfiTAKGASR 






1 PXXP.ABL. J3 .PP. INC 


SSPPXXXPPPIPNSPQVLSfi 


X J O 




1 PXXP.ABL. J4 .PP 


S SPPTWTPPKP PGWG WFSR 


137 


15 


PXXP. ABL. LI. PP 


gSAPTWSPEALPNVAKYKSB 


138 




PXXP.ABL.L2/3.PP 


SSIKGPRFEVPEVPLNGVSE 






PXXP. ABL. L4.PP 


SSPPAWSPPHREVAFGSTSR 


14 U 




1 CONSENSUS 


PPxWxPPPi^P 


^ ji 

14 1 1 


20 


TABLE 4 PLCG SH3 -BINDING PEPTIDES 










SEQ. ID NO. 




PXXP.PLCG.Pl.PP 


SSMKVHNFELP£LPSYETSE 


142 




PXXP. PLCG. P 2. PP 


SBVPPLVAERPESTLNSLSB 


14 3 


25 


PXXP . PLCG . PE . PP . INC 


SSLYWQHGEDPEVGAPQLSE 


144 




1 PXXP . PLCG . P4 . PP 




S SHPLNSWEGGEFRHNLSSB 


145 1 



30 



35 
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TABLE 5 



SRC SH3 -BINDING PEPTIDES 







SEQ. ID NO* 1 


PXXP-SRC.Al.PP 


SSRALRVR£LPEVPGTSLSE 


146 


PXXP. SRC*A2 .PP 

g 


S S FRAIiPLEPTEDNPFAGSB 


147 


PXXP.SRC.A3^PP 


SBDAPGSLEFRELPPVPTSB 


148 


— " 

PXXP . SRC . A4 . PP 


SSI SQRALEPLELMSDPASR 


149 


PXXP . SRC • Bl . PP 


SSPAYRPLERLEDLSVIYSR 


150 


PXXP. SRC.B2/3/PP 


SSFINRRLEALEPDNSLLSE 


151 


\ PXXP. SRC, 84 .PP 


SRLTGRPLPALEPPFSDFSB 


152 


PXXP . SRC . CI . PP 


SRMKDRVLPP I£TVES AVSfi 


153 


PXXP . SRC . C2 . PP . INC 


5SLYSAIAPDPEPRNSSSSE 


154 


PXXP. SRC. C3 .PP 


S SLASRPLELLPNS APGQSB 


155 


PXXP. SRC. Dl.PP 


SSLTSRPLED lEVRPSKSSB 


156 


PXXP . SRC . D2 . PP . INC 


SSLKWRALPPLPETDTPYgB 


157 


1 PXXP . SRC . D3 . PP 


SSNTNRLPEPTEDGLDVRSB 


158 


1 PXXP . SRC . D4 . PP 


gSLQSRPLELPEQS S YP I SR 


159 


i CONSENSUS 


RPLPPIiP 


160 



10 



15 



20 



25 



30 



It should be apparent to one of ordinary skill that many 
other embodiments of the present invention can be 
contemplated beyond the preferred embodiments described above 
but which other embodiments nevertheless fall within the 
scope and spirit of the present invention. Hence, the 
present invention should not be construed to be limited to 
the preferred embodiments described herein, which serve only 
to illustrate the present invention, but only by the claims 
that follow. 

Also, numerous references are cited throughout the 
specification. The complete disclosures of these references 
are incorporated by reference herein. 



35 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: SPARKS, ANDREW B. 

KAY, BRIAN K. 
. THORN, JUDITH M. 

^ QUILLIAM, LAWRENCE A. 

DER, CHANNING J. 

(il) TITLE OF INVENTIONi Src SH3 BINDING PEPTIDES AND METHODS OF 
ISOLATING AND USING SAME 

(ill) NUMBER OF SEQUENCES: 106 

10 (^v> CORRESPONDENCE ADDRESS: 

^ (A) ADDRESSEE: Pannia & Edmonds 

(B) STREET: 115S Avenutt of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036-2711 

(V) COMPUTER READABLE FORM: 
*5 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) PILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: S. Leslie Misrock 

(B) REGISTRATION NUMBER: 18,872 

(C) REFERENCE /DOCKET NUMBER: 1101-192 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 790-9090 

(B) TELEFAX: (212) 869-9741/8864 
25 (C) TELEX: 66141 PENNIE 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
(B> TYPE: amino acid 

( D ) TOPOLOGY : unknown 

30 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

<A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 . ^ 

(D) OTHER INFORMATION: /note- "hydrophobic residue (such 
as Pro or Leu)" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Xaa Pro Xaa Xaa Pro Pro Pro Xaa Xaa Pro 

15 10 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 
<B) TYPE: amino acid 
( D ) TOPOLOGY : unknown 

5 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KBY: Modif iad-aita 

(B) LOCATION: 1..3 

(D) OTHER INFORMATION: /note- "Xaa - any residue other 
than Cys" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note- "X - any residue other 
than Cys" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8.. 9 

(D) OTHER INFORMATION: /note= "X » any residue other 
^5 than Cys" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Xaa Xaa Xaa Pro Pro Xaa Pro Xaa Xaa 

1 5 

(2) INFORMATION FOR SEQ ID NO: 3: 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 wi . 
^D) OTHER INFORMATION: /note- "X tends to be hydrophobic 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8.. 9 ^ ^ ^ w w . 
(D) OTHER INFORMATION: /note- "X tends to be hydrophobic 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Arg Xaa Leu Pro Pro Arg Pro Xaa Xaa 

1 5 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
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5 



10 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NOt4: 

Arg Lyo Lou Pro Pro Arg Pro Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 5: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Val Pro Pro Pro Val Pro Pro Arg Arg Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acide 
«K (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-aite 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= -tends to be Pro- 

20 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-eite 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "hydrophobic residue* 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-eite 

(B) IrfOCATION: 5 

25 (D) OTHER INFORMATION: /note« -tends to be Pro" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Xaa Xaa Xaa Pro Xaa Xaa Pro 

1 5 



30 (2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

3 5 (ix) FEATURE: 

(A) NAME/KEY: Modifi d-8ite 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note- "X - any residue other 
than Cys" 
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(ix) FEATURE: 

(A) NAME/KEY: Modifi d-sito 

(B) LOCATIONt 8»,9 . ^ 

(D) OTHER INFORMATION: /note- "X - any reaxdu oth r 
than Cyo" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOi7: 

* Arg Xaa Leu Pro Pro Arg Pro Xaa Xaa 

1 5 

(2) INFORMATION FOR SEQ ID NOsS: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 
« A (B> TYPE: amino acid 

* (D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE? 

<A) NAME/KEY: Modified-eite 

(B) LOCATION: 9 « 
(D) OTHER INFORMATION: /note= "hydrophobic residue 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Arg Xaa Leu Pro Pro Leu Pro Arg Phe 
1 5 

(2) INFORMATION FOR SEQ ID NOi9: 

2o (i) SEQUENCE CHARACTERISTICS.: 

(A) LENGTH: 7 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

2 5 Arg Pro Leu Pro Pro Leu Pro 

1 5 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
3Q (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

( B ) LOCAT ION : 2 

(D) OTHER INFORMATION: /note- "any residue other than Cys' 

35 (ix) FEATURE: 

(A) NAME/KEY: Modifi d-sit 

( B ) LOCAT I N : 5 • • 6 

(D) OTHER INFORMATION: /note- "any hydrophobic reaidue" 
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( ix ) FEATUKE s 

(A) NAME/KEY t Modif iod-Bite 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note- "any hydrophilic residue 
except Cys" 

SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Xaa Leu Pro Xaa Xaa Pro Xaa Xaa 

5 



(xi) 



Arg 
1 



(2) INFORMATION FOR SEQ ID NO: 11: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknovm 

(ii) MOLECXJLE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Arg Pro Leu Pro Pro Leu Pro Thr Ser 
1 5 

15 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A ) v LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (ii) MOLECXn*E TYPE: peptide 

(V) FRAGMENT TYPE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Xaa Xaa Xaa Xaa 

1 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOIXXSY: linear 

(ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Xaa Xaa Xaa Xaa Xaa 

1 5 



35 (2) INFORMATION FOR SEQ ID NO:14: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: auDino acid 
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(D) TOE>OLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: 
<xi) SEQUENCE OBSCRIPTIONi SEQ ID NOil4: 

' Xaa Xaa Xaa Xaa 

1 

<2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) UNGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: peptide 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-oite 

( B ) LOCAT I ON : 1 • • 3 

(D) OTHER INFORMATION: /note= "any residue other than Cys" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-flite 

(B) LOCATION: 5 w ^ 
(D) OTHER INFORMATION: /note= "any residue other than Cys 

(ix) FEATURE: 

(A) NAME/KEY I Modif ied-Bite 

(B) LOCATION: 8. .9 ^ ^ ^. i ^ - 
(D) OTHER INFORMATION: /note" "any hydrophobic residue 

(ix) FEATURE: 

(A) NAME/KEY t Modif ied-site 

(B) LOCATION: 11.. 13 ^ , 
(D) OTHER INFORMATION: /note- "any residue other than Cys 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Xaa Xaa Xaa Arg Xaa Leu Pro Xaa Xaa Pro Xaa Xaa Xaa 

15 10 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 
3 A (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 



35 



xaa Xaa Xaa Xaa Pro Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Arg 
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35 40 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 
<D) TOPOLOGY: linear 



(ii) MOLECXn-E TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CGCTCCAGNN NBNNBNNBNN BNNBNNBNNB NNBNNBNNBN NBNNBNNBNN BNNBNNBNNB 

10 

NNBCCAGGT 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GGTCCANNVN NVNNVNNVNN VNNVNNVNNV NNVNNVNNVN NVNNVNNVNN VNNVNNVNNV 
20 AGATCTGG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TCGAGNNNBN NBNNBNNBNN BNNBNNBNNB NNBNNBNNBN NBNNBNNBNN BNNBNNBNNB 
CCAGGT 

30 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



3S 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
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GGTCCANNVN NVNNVNNVNN VNNVNNVNNV OTJVNNVNKVN NVNNVNNVNN VNNVNNVNNV 60 
AGATC 

(2) INFORMATION FOR SEQ ID NO:21: 

m (i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH t 46 •mtno acids 

(B) TYPE: amino acid 

(D) TOFOLOGjti linsasr 

(ii) HOLECin-E TYPE: protain 
(V) FRAGMENT TYPE: 

in (^x) FEATXJRE: 

(A) NAME/KEYt Modif iad-aite 

(B) LOCATIONS 2 

(D) OTHER INFORMATION: /note= "Arg or Ser" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



15 



20 



29 



Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

15 10 15 

Xaa Xaa Xaa Xaa Pro Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Xaa xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Arg Pro Ser Arg Thr 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acide 

(B) TYPE: amino acid 

( D ) TOPOLOGY : un)cnown 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



30 



ser His ser Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Arg Pro Ser Arg 
35 40 45 

Thr 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acide 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

( ix ) FEATURE : 
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(A) NAME/KEY t Modlf ied-site 

(B) LOCATZONt 2 

(D) OTHER INFORMATION: /not©" "Ser or Thr" 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) UX:ATI0N: 13 

(D) OTHER INFORMATION: /note= "Ser, Arg, Glyr Cye or Trp" 

( ix ) FEATURE : 

(A) NAME/KEY: Modified-oite 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /note- -Val, Ala, Asp, Glu or Cly" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa 

1 5 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Arg 

20 25 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
^5 (A) LENGTH: 48 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
20 TTTTGTCGAC NNNBNNBNNB NNBNNBNNBN NBNNBNNBNN BNGCGGTG 48 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
2 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CGCCACNVNN VNNVNNVNNV NNVNNVNNVN NVNNVNNVTG ATCATTTT 48 

30 (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CGCCACNVNN VNNVNNVNNV NNVNNVNNVN NVNNVNNVTG ATC 43 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUSNCE CHARACTERISTICS: 

(A) LENGTH: 43 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEy ID HOs27; 
TCGACNNNBN NBNNBNNBNN BNNBNNBNNB NNBNNBNGCG GTG 



(2) INFORMATION FOR SEQ ID NO: 28: 

10 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(V) FRAGMENT TYPE: 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-eite 

(B) LOCATION: 2 

<D) OTHER INFORMATION: /note« "Ser or Thr" 

(ix) FEATXmE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 13 

20 (D) OTHER INFORMATION: /note* -Ser, Arg, Gly, Cys or Trp" 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /note= "Val, Ala, Aep, Glu or Gly" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Ser X»a Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa 

^^1 5 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Arg Pro Ser Arg Thr 

20 25 30 ^ 



(2) INFORMATION FOR SEQ ID NO: 29: 

3Q (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

( ix ) FEATURE : 

(K) NAME/KEY: Modif ied-site 
35 (B) LOCATION: 5 

(D) OTHER INFORMATION: /note* "Ser or Thr' 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 
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<B) LOCATION: 16 

(D) OTHER INFORMATION: /not - "Ser, Arg, Cly, Cye or Trp" 

(ix) FEATURE: 

(A) NAME/KEY: Hodif iod-aito 

(B) LOCATION; 18 

(D) OTHER INFORMATION: /note« "Val, Ala, Aap, Glu or Cly" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Sor HiB Ser Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

15 10 15 

Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Arg Pro Sor 

20 25 30 



10 



Arg Thr 



INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

1 5 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 58 baae pairs 
2 5 (E) TYPE: nucleic acid 

(C> STRANDBDNESS I both 
( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

TGACG^iCTCG AGTTGTNNKN NKNNKNNKNN KNNKNNKNNK TGTGGATCTA GAAGGATC 

30 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: un)cnown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



- 69 - 



(2) 

15 
20 
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CCTAGATCTT CCTAG 

(2) INFORMATION FOR SEQ ID NOt33: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH! 40 base pairs 
c <B) TYPE: nuclaic acid 

(C) STRANDBDNBSS : double 

( D ) TOPOLOGY : 1 inear 

<ii) MOUCULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
TCCAGTTGTN NKNNKNNKNN KNNKNNKNNK NNKTGTGGAT 40 

10 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDBDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CAACANNMNN MNNMNNMNNM NNMNNMNNMA CACCTAGATC 40 

(2) INFORMATION FOR SEQ ID NO: 35: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: un)cnown 

(ii) MOLBCULE TYPE: protein 

(V) FRAGMENT TYPE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Ser Ser Cye Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Gly Ser Arg Pro 

15 10 15 

Ser Arg Thr 



25 



30 (2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECtra«E TYPE: protein 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Ser ser Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Gly Ser Arg Ser 
15 10 15 
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Thr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Thr Arg 

20 25 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
K (A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: double 

( D ) TOPOLOGY t unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
TCGAGTTGTN NKNNKNNKNN KNNKNNKNNK NNKTGTGGAT CTAGATCCAC AVNNVNNVNN 60 
VNNVNNVNNV NNVNNACAAC ®° 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 
,c <A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : unknoum 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



20 



Ser Ser Phe Asp Gin Gin Asp Trp Asp Tyr Ser He Aia Glu Lys Met 
15 10 IS 

His Pro He Arg Pro Gly Phe Arg Glu Leu Pro Pro Leu Pro Pro Ser 
20 • 25 30 

Arg Ala Ser Phe Gly Gly Gly Ala Ser Arg Pro Ser Arg 
35 40 45 

25 (2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Ser Thr Asn Val Trp Val Thr Gly Ser Val lie Ala Arg Gly Ala Gin 
15 10 15 

Ser Arg Pro Leu Pro He Pro Pro Glu Thr Arg Pro Ser Arg 
20 25 30 

35 (2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

- 71 - 



SUBSTITUTE SHEET (RULE 26) 



BH8DOC1D: <WO_0eO3d4aA1JA> 



wo 96/03649 



PCT/US95/09382 



( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

Ser Thr Ala Pro Trp Gly Leu Arg Val Ala His Glu Gly Gly Val Leu 
5 1 5 10 15 

Lve Arg Pro Leu Pro lie Pro Pro Val Thr Arg Pro Ser Arg 

^20 25 30 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 
«Q (A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
{ D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Ser Ser Ser Gly Tyr Val Val Pro Lys Arg Leu Gly Asp Met Arg Glu 
15 1 5 10 15 

Tyr Asn Ala His Pro Gly Leu Hie Val Pro Pro Asn Ser Pro Leu Pro 
20 25 30 

Pro Leu Pro Thr Hia Leu Gin Ser Ser Arg Pro Ser Arg 
35 40 45 

20 (2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Ser Ser Ara Gly Glu Gly Asn Asn lie lie Ser Ser Arg Pro Phe Leu 
15 10 15 

Ser Asn Ser Asp Pro Gly Val Ser Asn Lys Leu Thr Gly Arg Gly Pro 
20 25 30 



30 



35 



Leu Pro Pro Leu Pro Asn Asp Ser Arg Pro Ser Arg 
35 40 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
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Ser Thr Ala Val Ser Ph« Arg Pho Met Pro Cly Gly Gly Gly Ala Phe 

15 10 15 

Tvr Ser Thr Arg Pro Val Pro Pro lie Thr Arg Pro Ser Arg Thr 
20 25 30 



^ (2) INFORMATION FOR SEQ ID NOx44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acide 

(B) TYPE: amino acid 

( D ) TOPOLOGY : u nknown 



10 



15 



20 



25 



30 



35 



(ii) MOLECtn^E TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Ser Thr Ala His Ser Leu Trp Asp Trp Gly Thr Phe Ser Gly Val Ser 
15 10 15 

Hie Lys Ser Arg Leu Pro Pro Leu Pro Thr Arg Pro Ser Arg Thr 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Pro Gly Tyr Ala Arg lie Val Ser Tyr Arg Phe Arg Ala Leu Pro Ser 
15 10 15 

Pro Pro Ser Ala Ser Arg Pro Ser Arg 
20 25 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY t un)cnown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Ser Thr Asn Aap Val Aop Trp Met His Met Trp Aan Ser Gly Gly Pro 
15 10 15 

His Arg Arg Leu Pro Pro Thr Pro Ala Thr Arg Pro Ser Arg 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 47: 
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IS 



(i) SBQUBNCE CHARACTBRISTICS : 

(A) LSNGTHt 45 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLBCULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Ser Ser Asp Asn Trp Ala Arg Arg Val His Ala Ser Glu Leu He Tyr 

15 10 IS 

Thr Asp Leu Ser Pro Gly He Leu Leu Ala Gin Arg Gin Leu Pro Pro 
20 25 30 

Thr Pro Gly Arg Asp Pro Ser His Ser Arg Pro Ser Arg 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : u nkno wn 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Ser Ser Glu Ser Pro Leu Met Tyr Asn Arg Val Gly Ala Leu Gin Ser 
15 10 15 

Leu Thr Ser Val Pro Gly Ser Met Met His Phe Ala Leu Gin Arg Arg 
20 25 30 

Leu Pro Arg Thr Pro Pro Pro Ala Ser Arg Pro Ser Arg 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 
2 5 (A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

ser Thr Arg Trp Ser His Ser Trp Pro Gly Tyr Val Gly Gly Ala Asn 
30 1 5 10 15 

Pro ser Pro Ala Thr Arg Pro Leu Pro Thr Arg Pro Ser Arg Thr Val 
20 25 30 

Glu Ser Cys 
35 

35 (2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 
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15 



20 



25 



30 



35 



(D) TOPOLOGY < unknown 

(ii) MOLECULE TYPBi peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Ser Arg Tyr A«n Aep Lou Gly Thr Arg Pro Val Ser Glu Val lie Lye 
15 10 15 

Tvr Aep Tyr Phe Pro Gly Tyr Ser Gin Hie Val lie Thr Pro Asp Gly 
^ 20 25 30 

Ser Tyr Ser Thr Arg Pro Leu Pro Ser Arg Pro Ser Arg Thr Val Glu 
35 40 45 

Ser Cye 

50 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Pro Gly Ara Leu Leu Leu Pro Ser Glu Pro Arg Thr Phe Tyr Asn Tyr 
15 10 15 

Gly His Asp Ser Arg Pro Ser Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Ser Thr Met Tyr Gly Val Ser Trp Leu Ser Ser Gly Ser Gly Gly He 
15 10 15 

Leu Ala Pro Pro Val Pro Pro Arg Asn Thr Arg Pro Ser Arg 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: un)cnovm 

(ii) MOLECULE TYPE: peptid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
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Ser Smr Cvb Thr Glu Lye Thr Val Ser Gly Trp Cys Gly Ser Arg Ser 
15 10 15 

Thr Arg Pro L«u Pro II I*«u Pro Arg Thr Thr Arg Pro Ser Arg 
20 25 30 



(2) INFORMATION FOR SEQ ID NOt54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH* 31 asaino -cids 

(B) TYPEt amino acid 
(D) TOPOLOGY t unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Ser ser Cys Met Leu Pro Thr Aap Gly Trp Gin Cya Gly Ser Arg Ser 
15 10 15 

Thr Pro Arg Pro Leu Pro Met Leu Pro Thr Thr Arg Pro Ser Arg 
20 25 30 



3^5 (2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknovm 

(ii) MOLECULE TYPE: peptide 
2 0 <*^> SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

ser Ser Cys Asp Gly Thr Gin Phe Arg Leu Asn Cys Gly Ser Arg Ser 
15 10 15 

Thr Asn Arg Pro Leu Pro Met lie Pro Thr Thr Arg Pro Ser Arg 
20 25 30 



25 (2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknovm 

(ii) MOLECULE TYPE: peptide 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Ser Ser Cys Met Gin Gly Gin Ala Gly Leu Lys Cys Gly Ser Arg Ser 
15 10 15 

Thr Arg Pro Leu Pro Ser Leu Pro lie Thr Thr Arg Pro Ser Arg 
20 25 30 



35 

(2) INFORMATION FOR SEQ ID NO: 57: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 31 amino acide 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknovm 

(ii) MOLECULE TYPE: peptide 

J (xi) SEQUENCE DESCRIPTION: 5EQ ID NO: 57: 

Ser Ser Cys Tyr fucg Clu Lye Asp Thr Trp Gly Cye Gly Ser Arg Sor 
15 10 15 

Thr Ser Arg Pro Leu Pro Ser Leu Pro Thr Thr Arg Pro Ser Arg 
20 25 30 

10 <2) INFORMATION FOR SEQ ID NOiSSt 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : u nknovm 

(ii) MOLECtn:*E TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Ser Ser Cys Leu Phe Glu Gin Gly Ala Gly Thr Cye Gly Ser Arg Ser 
15 10 15 

Thr Arg Ser Leu Pro Pro Leu Pro Pro Thr Thr Arg Pro Ser Ser Arg 
20 25 30 

20 (2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknovm 

(ii) MOLECULE TYPE: peptide 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Ser Ser Cys Asp Thr Gly Arg He Ala Pro Gly Cys Gly Ser Arg Ser 
15 10 15 

Thr Pro Arg Pro Leu Pro Leu He Pro Thr Thr Pro Arg Ser Thr Asn 
20 25 30 



30 



Leu Asn Leu Thr Ser Thr Thr Thr Arg Pro Ser Arg 
35 40 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

( D ) TOPOXXX^Y : unknown 

(ii) MOLECULE TYPE: peptide 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Ser Sor Cya Cly I*ou Asp Asn Ala Ala Lya Thr Cyo Gly Sor Arg Sor 
15 10 15 

Thr Arg Pro Leu Pro Pro Thr Pro Leu Thr Thr Arg Pro Ser Arg 
20 25 30 

5 

<2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acide 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

10 (^^) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Ser Ser Cys Ser Arg Ala Hie Glu Thr Glu Met Cye Gly Ser Arg Ser 
15 10 15 



15 



Thr Arg Pro Gin Pro Pro Pro Pro lie Thr Thr Arg Pro Ser Arg 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 62: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

20 (^^> MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

Arg Ser Thr Pro Arg Pro Leu Pro Met Leu Pro Thr Thr Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO:63t 

2S 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 13 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



30 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Gly lie Leu Ala Pro Pro Val Pro Pro Arg Aen Thr Arg 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 64: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(il) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Val L u Lya Arg Pro Leu Pro lie Pro Pro Val Thr Arg 
15 10 

(2) INFORMATION FOR SEQ ID NOi65t 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Gly Pro His Arg Arg Leu Pro Pro Thr Pro Ala Thr Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 66: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECtJLE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

9 0 Ala Asn Pro Ser Pro Ala Thr Arg Pro Leu Pro Thr Arg 

1 5 10 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acide 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 



30 



35 



Arg Ser Thr Pro Arg Pro Leu Pro Pro Leu Pro Thr Thr Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
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Gly Thr Val Clu Pro Val Pro Pro Pr Val Pro Pro Arg Arg Arg Pro 
1 5 10 15 

Glu Ser Ala 

(2) INFORMATION FOR SEQ ID NOt69i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOI*OGY: unknown 

(ii) MOLECtJLE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOs69s 

Lva lie ser Pro Pro Thr Pro Lys Pro Arg Pro Pro Arg Pro Leu Pro 
1^ 5 10 15 

Val 

(2) INFORMATION FOR SEQ ID NO: 70: 

-5 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 13 amino acids 
(B) TYPE: amino acid 
( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

9n Leu Ala Pro Pro Lys Pro Pro Leu Pro Glu Gly Glu Val 

^® 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

25 

(ii) MOLECULE TYPE: peptide 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 ^ w ^ « 
(D) OTHER INFORMATION: /note« "any residue other than cys 

( ix ) FEATURE : 
3Q (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5,,6 ^ ^ w n 

(D) OTHER INFORMATION: /note- "any hydrophoba.c residue 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Arg Xaa Leu Pro Xaa Xaa Pro 

1 5 



35 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: poptid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

K Pro Gly Pho Arg Glu Leu Pro Pro Leu Pro Pro Ser Arg 

* 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: unlcnown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Ala Gin Ser Arg Pro Leu Pro lie Pro Pro Glu Thr Arg 
15 10 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknovm 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Pro Pro Asn Ser Pro Leu Pro Pr'o Leu Pro Thr Hie Leu 
15 10 



(2) INFORMATION FOR SEQ ID NO: 75: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Thr Gly Arg Gly Pro Leu Pro Pro Leu Pro Aen Asp Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Tyr S r Thr Arg Pro Val Pro Pro lie Thr Arg Pr Ser 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 77 1 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unkno%m 

(ii> MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77i 



10 



Ser Hifl Lye Ser Arg Leu Pro Pro Leu Pro Thr Arg Pro 
15 10 



(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Tyr Arg Phe Arg Ala Leu Pro Ser Pro Pro Ser Ala Ser 
15 10 

20 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknovm 

2 5 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Leu Ala Gin Arg Gin Leu Pro Pro Thr Pro Gly Arg Asp 
15 10 

(2) INFORMATION FOR SEQ ID NO: 80: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



35 



Ala L u Gin Arg Arg L u Pr Arg Thr Pro Pro Pro Ala 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH X 13 amino acide 

(B) TYPE: amino acid 

( D ) TOPOLOGY I unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Pro Ala Thr Arg Pro Leu Pro Thr Arg Pro Ser Arg Thr 
15 10 



(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Tyr Ser Thr Arg Pro Leu Pro Ser Arg Pro Ser Arg Thr 
15 10 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Xaa Pro Gly Arg lie Leu Leu Leu Pro Ser Glu Pro Arg 



(?) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 13 amino acids 
(B) TYPE: amino acid 
( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Ser Gly Gly lie Leu Ala Pro Pro Val Pro Pro Arg Asn 
15 10 



35 (2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: peptid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Xra Ser Thr Arg Pro Leu Pro lie Leu Pro Arg Thr Thr 
1 5 10 

(2) INFORMATION FOR SEQ ID NO186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Ser Thr Pro Arg Pro Leu Pro Met Leu Pro Thr Thr Arg 
15 10 



15 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acide 

(B) TYPE: amino acid 

( D ) TOPOLOGY : un)cnown 

20 MOLECXn*E TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Ser Thr Aen Arg Pro Leu Pro Met He Pro Thr Thr Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 88: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: un)cnovm 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 



30 



Ara Ser Thr Arg Pro Leu Pro Ser Leu Pro He Thr Thr 
1 ^ 5 10 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
35 (B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTIONt SEQ ID NOi89: 

Ser Thr S r Arg Pro Leu Pro Ser Leu Pro Thr Thr Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknovm 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Arg Ser Thr Arg Pro Leu Pro Ser Leu Pro lie Thr Thr 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unlcnown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: 

Arg Ser Thr Arg Gin Leu Pro lie Pro Pro Thr Thr Thr 
15 10 



(2) INFORMATION FOR SEQ ID NO:92: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
2 5 (B) TYPE: amino acid 

( D ) TOPOLOGY : unknovm 

(ii) MOLECXJLE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Ser Thr Pro Arg Pro Leu Pro Leu lie Pro Thr Thr Pro 
1 5 10 

30 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknovm 

35 (ii) MOLECULE TYPEs peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
Arg Ser Thr Arg Pro Leu Pro Pr Thr Pr Leu Thr Thr 
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15 10 

(2) INFORMATION FOR SEQ ID NO:94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknovm 

(ii) nOL£CUL£ xYFEs peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOt94: 

Ara Sor Thr Arg Pro Gin Pro Pro Pro Pro lie Thr Thr 
1 5 10 



10 



15 



(2) INFORMATION FOR SEQ ID NO: 95 2 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
(B> TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:95: 

Arg Pro Leu Pro Met Leu Pro 
1 5 

20 <2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

2 5 ( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note- "Pro or Met' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Arg Pro Leu Pro Xaa Leu Pro 
30 1 5 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
(B> TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 



-se- 
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Arg ser Thr Pro 
1 



<2) INFORMATION FOR SEQ ID NO«98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY t unknown 

(ii) MOLECXJLE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Arg Ser Thr Pro Ala Pro Pro Val Pro Pro Arg Thr Thr Arg 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 baee pairs 
«5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
AGCGTAACGA TCTCCOG 

20 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
TTCACCTCGA AAGCAAGCTG 

(2) INFORMATION FOR SEQ ID NO: 101: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:101: 

CCTCATAGTT AGCGTAACG 
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(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQtTBNCB CHARACTBRISTICS : 

(A) LENGTH: 16 baaa pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNBSS : unknovm 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
AGCGTAACGA TCTAAA 



10 (2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

Lye Ser Gly Ser Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 104: 

20 SEQIJENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acida 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

25 Gly Ser Gly Ser 

1 



(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: un)cnovm 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

Glu Gin Lye Leu lie Ser Glu Glu Asp Leu Aen 
15 10 

35 

(2) INFORMATION FOR SEQ ID NO: 106: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 31 amino acids 

(B) TVPEx amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

Ser Ser Cye Asp His Thr Leu Gly Leu Gly Trp Cys Gly Ser Arg Ser 
1 5 10 15 

Thr Arg Gin I^u Pro lie Pro Pro Thr Thr Thr Arg Pro Ser Arg 
20 25 30 



10 



15 



20 



25 



30 



35 
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WHAT IS CLAIMED IS: 

1. A peptide having at 1 ast nine and up to forty-five 
amino acid residues, including an amino acid sequence of the 
formula, R-2-L-P-5-6-P-8-9 (SEQ ID N0:10), positioned 
5 anywhere along the peptide, in which each number represents 
an amino acid residue, such that 2 represents any amino acid 
residue except cysteine, 5 and 6 each represents a 
hydrophobic amino acid residue, 8 represents any amino acid 
residue except cysteine, and 9 represents a hydrophilic amino 
10 acid residue except cysteine, each letter being the standard 
one^letter symbol for the corresponding amino acid, said 
peptide exhibiting a binding affinity for the SH3 domain of 
Src, provided that said peptide is not R-P-L-P-P-L-P-T-S (SEQ 
ID NO: 11) . 

15 2. The peptide of claim 1 in which 2 is a P, R, A, L, 

Q, E or S, 

3. The peptide of claim 1 in which 5 is a P, M, I or 

L. 

4* The peptide of claim 1 in which 6 is a P, L, I or 

20 V. 

5. The peptide of claim 1 in which 8 is a T, R, P, 1/ 
N, E, V, S, A, G or L- 

6. The peptide of claim 1 in which 9 is a T, R, S, H 

or D. 

25 ?• The peptide of claim 1 which further comprises a 

C-terminal-f lanking amino acid sequence of the formula 10, 
10-11, 10-11-12, 10-11-12-13 (SEQ ID NO: 12) or 10-11-12-13-14 
(SEQ ID NO: 13), in which each number represents any amino 
acid residue except cysteine, such that 10 is bound to 9 by a 

30 peptide bond* 

8. The peptide of claim 7 in which 10 is T, R, L, S, 

D, P, A or N. 

9. The peptide of claim 7 in which 11 is R, P, A, Q, S 

or T, 

35 10. The peptide of claim 7 in which 12 is P, S, R or T. 

11. The peptide of claim 7 in which 13 is P, S, R, F, H 

or T, 
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12. Th peptide of claim 7 in which 14 is S, R, G or 

13. The peptide of claim 1 which further comprises an 
N-t rminal-f lanking amino acid sequence of the formula 1', 
2«-i«, 3'-2«-l» or 4»-3'-2'~l' (SEQ ID NO: 14) in which each 

5 number represents any amino acid residue except cysteine, 
such that 1' is bound to R by a peptide bond. 

14. The peptide of claim 13 in which 1' is T, S, N, 

F, K, H, Q or G. 

15. The peptide of claim 13 in which 2' is S, T, G, P, 
10 R, Q, A or H. 

16. The peptide of claim 13 in which 3« is R, S, P, G, 
A, V, Y or L. 

17. The peptide of claim 13 in which 4' is R, S, V, T, 

G, L or F. 

15 18. A peptide having at least thirteen and up to forty- 

five amino acid residues, including an amino acid sequence of 
the formula, 3 • -2 • -1 • -R-2-L-P-5-6-P-8-9-10 (SEQ ID NO: 15), 
positioned anywhere along the peptide, in which each number 
represents an amino acid residue, such that 3', 2*, 1', 2, 8, 

20 and 10 each represents any amino acid residue except 

cysteine, 5 and 6 each represents a hydrophobic amino acid 
residue, and 9 represents a hydrophilic amino acid residue 
except cysteine, each letter being the standard one-letter 
symbol for the corresponding amino acid, said peptide 

25 exhibiting a binding affinity for the SH3 domain of Src. 

19. The peptide of claim 18 in which 5 is a P or M. 

20. The peptide of claim 18 in which 1' is T, P, s or 

N. 

21. The peptide of claim 18 in which 2' is S or T. 
30 22. The peptide of claim 18 in which 3' is R or S. 

23. The peptide of claim 18 in which 10 is T or R. 

24. The peptide of claim 1 the binding affinity of 
which is at least three-fold greater than that exhibited by 
the peptide RPLPPLP for the SH3 domain of Src. 

35 25. The peptide of claim 18 th binding affinity of 

which is at least thr e-fold greater than that exhibited by 
the peptide RPLPPLP (SEQ ID NO: 9) for the SH3 domain of Src. 

- 91 - 



SUBSTITUTE SHEET (RULE 26) 



BHSDOaD: <VVO_9eC»3640A1JA> 



wo 96/03649 



PCT/US95/09382 



26. Th peptid of claim 1 the binding affinity of 
which is at 1 ast four-fold greater than that exhibited by 
the p ptide RPLPPLP (SEQ ID NO: 9) for the SH3 domain of Src* 

21. The peptide of claim 18 the binding affinity of 
5 which is at least four-fold greater than that exhibited by 
the peptide RPLPPLP (SEQ ID NO: 9) for the SH3 domain of Src. 

28. The peptide of claim 1 which further exhibits a 
general binding affinity for the SH3 domain of Abl, Grb2 , 
PLC-5, PLC-7, Ras GAP, Nek, p85 PI-3 'Kinase, and proteins 

10 related thereto. 

29. The peptide of claim 1 which exhibits a selectiv 
binding affinity for the SH3 domain of Src and Src-related 
proteins. 

30. The peptide of claim 18 which further exhibits a 
15 general binding affinity for the SH3 domain of Abl, Grb2 , 

PLC-«, PLC-7, Ras GAP, Nek, p85 PI-3 'Kinase, and proteins 
related thereto. 

31. The peptide of claim 18 which exhibits a selective 
binding affinity for the SH3 domain of Src and Src-related 

2 0 proteins. 

32. A peptide having the amino acid sec[uence 
RSTPRPLPMLPTTR (SEQ ID NO: 62). 

33. A peptide having the amino acid sequence 
RSTPRPLPPLPTTR (SEQ ID NO: 67). 

25 34. A peptide having the amino acid sequence 

GILAPPVPPRNTR (SEQ ID NO:63). 

35. A peptide having the amino acid sequence 
VLKRPLPIPPVTR (SEQ ID NO:64). 

36. A peptide having the amino acid sequence 
30 GPHRRLPPTPATR (SEQ ID NO: 65). 

37. A peptide having the amino acid sequence 
ANPSPATRPLPTR (SEQ ID NO: 66). 

38. A peptide having an amino acid sequence selected 
from the group consisting of RSTRPLPILPRTT, STPRPLPMLPTTR, 

35 STNRPLPMIPTTR, RSTRPLPSLPITT, STSRPLPSLPTTR, RSTRSLPPLPPTT, 
RSTRQLPIPPTTT, STPRPLPLIPTTP , RSTRPLPPTPLTT , and 
RSTRPQPPPPITT (SEQ ID NOS:85-94). 
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39. A peptide having the amino acid s quence selected 
from th group consisting of VLKRPLPIPPVTR (SEQ ID N0:€4), 
YSTRPVPFITRPS (SEQ ID NO:76), SHKSRLPPLPTRP (SEQ ID NO:77), 
GPHRRLPPTPATR (SEQ ID NO: 65), PATRPLPTRPSRT (SEQ ID NO: 81), 

5 and SGGILAPPVPPRN (SEQ ID NO:84). 

40. A peptide having the amino acid sequence selected 
from the group consisting of PPNSPLPPLPTHL (SEQ ID NO: 72), 
TGRGPLPPLPNDS (SEQ ID NO: 74), YRFRALPSPPSAS , LAQRQLPPTPGRD 
AliQRRLPRTPPPA (SEQ ID NOS: 78-80), YSTRPLPSRPSRT , and 

10 XPGRILLLPSEPR (SEQ ID NOS:82-83). 

41. A construct comprising a nucleic acid encoding a 
peptide of claim 1 or its complement. 

42. The construct of claim 41 which is a DNA 
polynucleotide. 

15 43. The construct of claim 41 which is a RNA 

polynucleotide . 

44. A construct comprising a nucleic acid encoding a 
peptide of claim 18 or its complement. 

45. The construct of claim 44 which is a DNA 
20 polynucleotide. 

46. The construct of claim 44 which is a RNA 
polynucleotide. 

47. The construct of claim 41 which is a transforming 

vector . 

25 48. The construct of claim 44 which is a transforming 

vector . 

49. A host cell transformed with the vector of claim 

47. 

50. A host cell transformed with the vector of claim 

30 48. 

51. A conjugate comprising a peptide of claim 1 and a 
second molecule. 

52. The conjugate of claim 51 in which said second 
molecule is selected from the group consisting of an amino 

35 acid, a p ptide, a protein, a nucleic acid, a nucleoside, a 
glycosidic residue, a label, a drug or a small molecule. 
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53. A diagnostic kit for the detection of SH3 domains 
comprising an SH3 domain-binding peptide and a detectable 
label conjugat d t said peptide directly, indirectly or by 
complexation, said peptide comprising: (i) a core sequence 

5 motif of the formula RXLP0<^P (SEQ ID NO: 71), wherein X 

represents any amino acid except cysteine and <p represents a 
hydrophobic amino acid residue, each letter representing the 
standard one- letter designation for the corresponding amino 
acid residue; and (ii) two or more additional amino acid 
10 residues flanking said core sequence at its C-terminal end, 
N-terminal end or both. 

54. A drug delivery system comprising an SH3 domain- 
binding peptide and a drug conjugated to said peptide 
directly, indirectly or by complexation, said peptide 

15 comprising: (i) a core sequence motif of the formula RXLP«<*P 
(SEQ ID NO: 71), wherein X represents any amino acid except 
cysteine and 0 represents a hydrophobic amino acid residue, 
each letter representing the standard one-letter designation 
for the corresponding amino acid residue; and (ii) two or 

2 0 more additional amino acid residues flanking said core 
sequence at its C-terminal end, N-terminal end or both* 

55. The drug delivery system of claim 54 which may be 
administered parenterally , orally, enterally, topically or by 
inhalation. 

25 56. The drug delivery system of claim 54 which may b 

administered intranasally , opthalmically or intra vaginally. 

57. The drug delivery system of claim 54 which is in 
the form of a solid, gel, liquid or aerosol. 

58. A method of modulating the activity of Src or Src- 
30 related proteins comprising administering a composition 

comprising an effective amount of a peptide of claim 1 and a 
carrier. 

59. The method of claim 58 which inhibits the activity 
of Src or Src-related proteins. 

35 60. The meth d of claim 58 which activates Src or Src- 

related proteins. 
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61. A method of identifying a peptide having a region 
that binds to an SH3 domain comprising: 

(a) providing an immobilized target protein 
comprising an SH3 domain; 
5 (b) incubating said immobilized target protein 

with an aliquot taken from a random peptide library; 

(c) washing unbound peptide from said immobilized 
target protein; 

(d) recovering the peptide bound to said 
10 immobilized target protein; and 

(e) determining the primary sequence of the SH3 
domain-binding peptide. 

62. The method of claim 61 in which said library is a 
displayed random peptide library. 

15 63. The method of claim 62 in which said library is a 

phage-displayed random peptide library. 

64 . The method of claim 62 in which said library is a 
phagemid-displayed random peptide library. 

65. The method of claim 61 in which step (c) includes 
20 washing unbound phage from said immobilized target protein; 

step (d) includes recovering the phage bound to said 
immobilized target protein; and step (e) includes determining 
the relevant nucleotide sequence of said binding phage 
nucleic acid, from which the primary sequence corresponding 
25 to the SH3 domain-binding peptide is deduced. 

66. A method of identifying a peptide having a region 
that binds to an SH3 domain comprising: 

(a) providing an immobilized target protein 
comprising an SH3 domain; 
30 (b) incubating said immobilized target protein 

with an aliquot taken from a phage-displayed random peptide 
library, which library includes peptides having a random 
sequence of ^8 amino acid residues; 

(c) washing unbound phage from said immobilized 

35 target protein; 

(d) recovering th phage bound to said immobilized 

target pr tein; and 
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( ) determining th relevant nucleotide sequenc 
of said binding phage nucleic acid and deducing the primary 
sequence corr spending to the SH3 domain-binding peptid . 

67. The method of claim 66 which further comprises 
5 amplifying the titer of the recovered phage. 

68. The method of claim 66 which further comprises 
repeating the steps of incubation, washing and recovery to 
provide SH3 domain-binding peptide-enriched phage. 

69. A pharmaceutical composition comprising an SH3 
10 domain-binding peptide and a pharmaceutically acceptable 

carrier, said peptide comprising: (i) a 9-mer sequence motif 
of the formula RXLPi^0PX^ (SEQ ID NO: 10), wherein X represents 
any amino acid except cysteine, <p represents a hydrophobic 
amino acid residue, and wherein ^ is a hydrophilic amino acid 

15 residue except cysteine, each letter representing the 

standard one-letter designation for the corresponding amino 
acid residue; and, optionally, (ii) additional amino acid 
residues flanking said 9-mer sequence at its c-terminal end, 
N-terminal end or both, up to a total of 45 amino acid 

20 residues, including said 9-mer sequence. 

70. The composition of claim 69 in which at least one 
additional amino acid flanks said 9-mer sequence. 

71. The composition of claim 69 in which at least two 
additional amino acids flank said 9-mer sequence. 

25 72. The composition of claim 69 in which at least three 

additional amino acids flank said 9-mer sequence. 

73. A method of disrupting protein tyrosine kinase- 
mediated signal transduction pathways comprising 
administering an effective amount of a peptide of claim 1. 

30 74. A method of regulating the processing, trafficking 

or translation of RNA by administering an effective amount of 
a peptide of claim 1. 



35 
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Sro 8E3 BZMDZIIO PEPTIDES MID 
umnna or xantAi'TMa 08IMQ BAME 

1. ri«ld Of til* ZBVMtioa 
S The praaent invention relates to SH3 binding 

peptides having a broad range o£ binding specificities. That 
is, certain menbers of the SH3 binding peptides disclosed 
bind with approximately the same facility with SH3 domains 
derived from different SH3 domain-containing proteins. Oth r 

10 members, in contrast, bind with a much greater degree of 

affinity for specific SH3 domains. The SH3 binding peptides 
are obtained from random peptide libraries that are also 
phage-displayed. Methods are described of obtaining the 
phage clones that bind to the SH3 domain targets and of 

15 determining their relevant nucleotide sequences and 
consequent primary amino acid sequence of the binding 
peptides. The resulting SH3 binding proteins are useful in a 
number of ways, including, but not limited to, providing a 
method of modulating signal transduction pathways at th 

20 cellular level, of modulating oncogenic protein activity or 
of providing lead compounds for development of drugs with th 
ability to modulate broad classes, as well as specific 
classes, of proteins involved in signal transduction. 

25 2. Background of the zaveatioB 

2.1. Bre and the 8B3 Domain 

Among a number of proteins involved in 
euJcaryotic cell signaling, there is a common sequence m tif 
called the SH3 domain. It is 50-70 amino acids in length, 

30 moderately conserved in primary structure, and can be pr s nt 
from one to several times in a large niimber of proteins 
involved in signal transduction and in cytoskeletal proteins. 

The protein pp60c-src represents a family of at least 
nine non-receptor protein tyrosine kinases (NR-PTKs) . 

35 Members f this family share an verall structural 

organiration comprising a series f catalytic and n n- 
catalytic d mains. In Src, a l4-amin -acid myristylation 
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signal r sid s at tii xtreme amlno^tenninus , and is follow d 
by a xiniqu r gion that is n t highly cons rved eonong family 
members. Following this region ar two highly c nserved 60- 
and lOO-amino-acid regions, the Src homology (SH) domains 3 
5 and 2, respectively. SH2 and SH3 domains have been shown to 
play an important role in mediating protein*protein 
interactions in a variety of signaling pathways. Koch, C.A., 
et al., in science (1991) 252:668-74. The carboxy- terminal 
half of Src contains the PTK catalytic domain, as well as a 

10 negative regulatory tyrosine (Y527) near the carboxy 

terminus. Phosphorylation of this residue (e.g., by Csk) 
results in the inhibition of PTK activity. Cooper, J. A., et 
al., in Science (1986) 231:1431-1434. Mutation of Y527->F 
generates forms of Src with increased PTK and oncogenic 

15 activity. Cartvnright, C.A., et al., in £fiil (1987) 49:83-91; 
Kmiecik, T.E., et al., in Qsll. (1987) 49:65-73; and Piwicna- 
Worms, H., et al., in Cell (1987) 75-82. 

The fact that some mutations which result in increased 
Src PTK and transforming activity map to the Src SH2 (Seidel- 

20 Dugan, C, et al., in Mol. Cell. Biol. (1992) 12:1835-45; and 
Hirai, H. and Varmus, H.E. in Mol. Cell. Biol. (1990) 
10:1307-1318) and SH3 domains (Seidel-Dugan, C. , et al., 
supra; Hirai, H. and Varmus, H.E., supra} Superti-Furga, G« , 
et al., in £mbfijL-jIj. (1993) 12:2625-34; and Potts, W.M. , et 

25 al., in Oncogene Res. (1988) 3:343-355) suggests a negative 
regulatory role for these domains. That phospho tyrosine 
residues within specific sequence contexts represent high 
affinity ligands for SH2 domains suggests a model in which 
the SH2 domain participates in Y527-mediated inhibition of 

30 PTK activity by binding phosphorylated ¥527, thereby locking 
the kinase domain in an inactive configuration. Matsuda, M. , 
Mayer, B.J., etal., in Science (1990) 248:1537-1539. This 
model is supported by the observation that phosphopep tides 
corresponding to the carboxy-terminal tail of Src bind 

35 active, but not inactiv , variants of Src. R uss 1, R.R., et 
al., in Proc. Natl. Acad. Sci. U S A (1991) 88:10696-700; and 
Liu, X., et al., in Oncogene (1993) 8:1119-1126. 
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Th m chanism f SH3-ni diated inhibition of Src PTK 
activity remains unclear. There is evid nc that pY527- 
nediat d inhibiti n of Src PTK activity involv s th SH3 
domain as well as the SH2 domain. Okada, M. , Howell, et al., 
5 in J, mol. Chem. (1993) 268:18070-5; Murphy, S.M., et al., 
in Moi. cell. Biol. (1993) 13:5290-300; and Superti-Furga, 
G., et al., supr». Although these effects are thought to be 
a consequence of SH3-mediated protein-protein interactions, 
precisely how the Src SH3 domain exerts its negative 
10 regulatory effect is unclear. Identification of high 

affinity ligands for the Src SH3 domain could help resolve 
these issues. 

2.2. Protein Tyrosine Kinases and The zmune Response 

15 src-related tyrosine kinases are expressed in a 

variety of cell types including those of the immune system 
(lymphocytes, T cells, B cells, and natural killer cells) and 
the central nervous system (neural cells, neixrons, 
oligodendrocytes, parts of the cerebellum, and the like). 

20 Umemori, H. et al. , in Brain Rgg. Hffil . Brain Rqgt (1992) Dec. 
16(3-4) : 303-310. Their presence in these cells and tissu s 
and their interaction with specific cell surface receptors 
and immunomodulatory proteins (such as T cell antigen 
receptor, CD14, CD2, CD4, CD40 or CD45) suggest that these 

25 kinases serve an important role in the signalling pathways of 
not only the central nervous system but of the immune system, 
as well. See, e.g., Ren, C.L. et al. , in J. EXP. Med. (1994) 
179(2) : 673-680 (signal transduction via CD40 involves 
activation of Lyn kinase); Donovan, J. A. and Koretzky, G.A. , 

30 in .T. Am. Soe. Nftphrol. (1993) 4(4): 976-985 (CD45, the immune 
response, and regulation of Lck and Fyn kinases); and Carmo, 
A.M. et al., in ByCa__iuJnBimfll^ (1993) 23 (9) :2196-2201 
(physical association of the cytoplasmic domain of CD2 with 

p561ck and p59fyn) . 
35 p r instanc , mic with disruptions in their Src-like 

genes, Hck and Pgr, possess macrophag s with impair d 
phagocytic activity or xhibit a novel immunodeficiency 
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characteriz d by an incr as d susceptibility to infecti n 
with List ria m nocytog n s. Lowell, C.A. t al., in Genes 
Dev> (1994) 8(4) :387-398. Also, it has been sh %m that 
bacterial lipopolysaccharide (LPS) activates CD14 -associated 
S p561yn, p68hck, and p59c-fgr, while inducing the production 
©f lyffiphoktnes; such as TNF^-alpha,- IL-6, and IL-8^ 

Inhibition of the protein tyrosine kinases blocks production 
of TNF-alpha and IL-1* 

10 2.3. 8H3 Binding Peptides 

As mentioned above, it has long been suspected that 
SH3 domains are sites of protein-protein interaction, but it 
has been unclear what SH3 domains actually bind. Efforts to 
identify ligands for SH3 domains have led to the 

IS characterization of a number of SH3-binding proteins, 

including 3BP1 and 2 (Ren, R. , Mayer, et al., in Science 
(1993) 259:1157-61), SOS (Olivier, J.P. , et al., in Cell 
(1993) 73:179-91; and Rozakis-Adcock, M. , et al., in Nature 
(1993) 363:83-5), p85 PI-3 • Kinase (Xingquan, L. , et al., in 

20 Mol. Cell. Biol. (1993) 13:5225-5232), dynamin (Gout, I., t 
al., in Qsll (1993) 75:25-36), AFAP-110 (Flynn, D.C., et al*, 
in Mol. Cell. Biol. (1993) 13:7892-7900), and CD42 (Barfod, 
E,T,, et al., in J. Biol. Chem. (1993) 268:26059-26062). 
These proteins tend to possess short, proline-rich stretches 

2S of amino acids, some of which have been directly implicated 
in SH3 binding. A variety of consensus sequences have been 
proposed, although the similarity among proline-rich regions 
of different SH3-binding proteins tends to be fairly low. 
Also, attempts to build consensus sec[uence8 are likely 

30 complicated by the incorporation of data from proteins that 
bind different SH3 domains. 

Thus, Cicchetti, P., et al., in Science (1992) 257:803- 
806, published their work relating to the isolation and 
sequencing of two naturally-occurring proteins that could be 

35 bound in vitro by th SH3 domain f the aJbl oncog n product. 
These work rs fovmd that SH3 domains bind short, proline-rich 
regions of such proteins. Subsequ ntly, this same group 
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disclosed furth r r suits (Ren, R. et al., supra) in which 
th SH3 binding sit s f the SH3 binding prot ins w r 
localized to "a nine- or ten-amino acid stretch rich in 
proline residues.** A consensus sequence incorporating the 
S features of the SH3 binding sites of four SH3 binding 

proteins was proposed: XPXXPPP¥XP (SEQ ID NO:l), wherein X 
indicates a position in the amino acid sequence which is n t 
conserved among the four SH3 binding proteins, P represents 
proline, and ¥ indicates a hydrophobic amino acid residue, 

10 such as P or L. 

The screening of complex random peptide libraries has 
been used to identify peptide epitopes for monoclonal (Scott, 
J.K. and Smith, G.P. in SsiSDS& (1990) 249:386-390) and 
polyclonal (Kay, B.K. , et al., in fifinfi (1993) 128:59-65) 

IS antibodies, as well as peptide ligands for a variety of 
proteins, including streptavidin (Devlin, J.J., et al., in 
Science (1990) 249:404-406; and Lam, K. , et al. , in WfltUfg 
(1991) 354:82-84), the endoplasmic reticulum chaperone BiP 
(Blond-Elguindi, S., et al., in fifiH (1993) 75:717-728), and 

20 CaM (Dedman, J.R. , et al., in J. Biol. Chem. (1993) 
268:23025-23030) . 

Recently, Chen, J.K. et al., in J- Am. Chem. Soc. (1993) 
115:12591-12592, described ligands for the SH3 domain of 
phosphatidylinositol 3-Xina8e (PI-3» Kinase) which were 

25 isolated from a biased combinatorial library. A "biased" 
library is to be distinguished from a "random" library in 
that the amino acid residue at certain positions of the 
synthetic peptide are fixed, i.e., not allowed to vary in a 
random fashion. Indeed, as stated by these research workers, 

30 screening of a "random" combinatorial library failed to yield 
suitable ligands for a PI-3 • Kinase SH3 domain probe. The 
binding affinities of these unsuitable ligands was described 
as weak, >100 pM, based on dissociation constants measiired by 
the Biosensor System (BIAcore) . 

35 

Mor recently, Yu, et al. (Yu, H., et al., in Ssll 
(1994) 76:933-945) used a "bias d" synthetic peptide library 
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f th form XXXPPXPXX (SEQ ID NO: 2), Wherein X represents any 
amino acid ther than cysteine, to id ntify a series of 
peptid s which bind the Src and PI-3 • Kinase SH3 domains. 
The bias was accomplished by fixing the proline residues at 
5 the specific amino acid positions indicated for the "random" 
peptide. As stated previously, without this biss^ the 
technique disclosed fails to identify any SH3 domain-binding 
peptides . 

A consensus sequence, based on 13 binding peptides was 
10 suggested: RXLPPRPXX (SEQ ID NO: 3), where X tends to be a 
basic residue (like R, K or H) , The binding affinities of 
several SH3 binding peptides were disclosed as ranging from 
8.7 to 30 fM, A "composite" peptide, RKLPPRPKR (SEQ ID 
NO: 4), was reported to have a binding affinity of 7.6 fM, 
15 This value compares favorably to the binding affinity of the 
peptide, VPPPVPPRRR (SEQ ID NO: 5), to the N-terminal SH3 
domain of Grb2. See, Kraulis, P.j. j. Anni. gr »vstano^ . 
(1991) 24:946. Recognizing the limitations of their 
technique, Chen and co-workers, supra, stated that their 
20 results "illustrate the utility of biased combinatorial 

libraries for ligand discovery in systems where there is some 
general Juiowledge of the ligand-bindlng cheiracteristics of 
the receptor" (emphasis added) . 

Yu and co-workers, supra, further described an SH3 
25 binding site consensus sequence, XpOPpXP (SEQ ID NO: 6), 
wherein X represents non-conserved residues, 0 represents 
hydrophobic residues, P is proline, and p represents residues 
that tend to be proline, a consensus motif of RXLPPRPXX (SEQ 
ID NO: 7), where X represents any amino acid other than 
30 cysteine, was proposed for ligands of PI-3' Kinase SH3 

domain. A consensus motif of RXLPPLPR^ (seq id NO:8) , where 
0 represents hydrophobic residues, was proposed for ligands 
of Src SH3 domain. Still, the dissociation constants 
reported for the 9-mer peptides ranged only from about 8-70 
35 MM and s 1 ctivity between one type of SH3 domain and another 
was relatively poor, the KoS diff ring by only about a factor 
of four. 
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Henc , there remains a n ed to d v lop techniques for 
th id ntif ication of Src SH3 binding peptid s which do not 
rely on such "^bias d" combinatorial peptide libraries that 
are limited to a partially predetermined set of amino acid 
5 sequences. Indeed, the isolation of SH3 binding peptides 
from a ""random** peptide library has not been achieved 
successfully before now. Furthermore, particular peptides 
having much greater binding affinities, whether general or 
more selective binding for specific SH3 domains, remain to be 

10 identified. Binding peptides specific for particular SH3 
domains are useful, for example, in modulating the activity 
of a particular SH3 domain-containing protein, while leaving 
others bearing an SH3 domain unaffected. Still, the more 
promiscuous general binding peptides are useful for the 

15 modulation of a broad spectz^im of SH3 domain-containing 
proteins. 

The present invention relates to such SH3 binding 
peptides, methods for their identification, and compositions 
comprising same. In particular, peptides comprising 

20 particular secjuences of amino acid residues are disclosed 
which were isolated from random peptide libraries. In the 
present invention, clones were isolated from a phage- 
displayed random peptide library which exhibited strong 
binding affinities for SH3 domain-containing protein targets. 

25 Some of these protein targets, include Abl, Src, Grb2, PLC-f, 
PLC-7, Ras GAP, Nek, and p85 PI-3 * Kinase. From the 
nucleotide sequence of the binding phage, the amino acid 
sequence of the peptide inserts has been deduced. Synthetic 
peptides having the desired amino acid sequences are shown to 

30 bind the SH3 domain of the target proteins. In particular, 
synthetic peptides combining a core consensus sequence and 
additional amino acid residues flanking the core sequence are 
especially effective at binding to particular target protein 
SH3 domains. The SH3 binding peptides disclosed herein can 

35 be utilized in a number f ways, including the p t ntial 
modulation f oncogenic prot in activity in vivo. These 
peptid s also serv as useful 1 ads in the production of 
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p ptldomim tic drugs that: modulat a large class of proteins 
involved in signal transducti n pathways and oncogenesis « 

3* Bumnarv of the invention 
5 Accordingly, three phage-displayed random peptid 

libraries were screened for isolates that bind to bacterial 
fusion proteins consisting of the Src homology region 3 (SH3) 
and glutathione S-transf erase (GST) . DNA sequencing of the 
isolates showed that they contained sequences that resemble 

10 the consensus motif, RPLPPLP (SEQ ID NO: 9), within their 8, 
22, or 36 amino acid long random regions. When peptides were 
synthesized corresponding to the pIII inserts of the SH3- 
binding phage, they bound to the GST fusions of the SH3 
domains of Src and the Src-related proteins, such as Yes, but 

15 not of Grb2, Crk, Abl, or PLC7I. The synthesized peptides 
bind quite well to the Src SH3 domain and act as potent 
competitors of natural Src SH3 interactions in cell lysates. 
For instance, these peptides can compete with radiolabelled 
proteins from cell lysates in binding to immobilized Src-GST, 

20 with an apparent IC50 of 1-10 mM- When a peptide, bearing the 
consensus sequence RPLPPLP (SEQ ID NO: 9) was injected into 
XBziopus laBvis oocytes, it accelerated the rate of 
progesterone- induced maturation. These results demonstrate 
the utility of phage-di splayed random peptide libraries in 

25 identifying SH3 -binding peptide sequences and that such 
identified peptides exhibit both in vivo and in vitro 
biological activity. 

Thus, it is an object of the present invention to 
provide peptides having at least nine and up to forty-five 

30 amino acid residues, including an amino acid secjuence of th 
formula, R-2-L-P-5-6-P-8-9 (SEQ ID NO: 10), positioned 
anywhere along the peptide, in which each number represents 
an amino acid residue, such that 2 represents any amino acid 
residue except cysteine, 5 and 6 each represents a 

35 hydrophobic amino acid residu , 8 r pres nts any amino acid 
residue exc pt cystein , and 9 repr sents a hydrophilic amino 
acid residu except cystein , each 1 tter being the standard 
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one-1 tter symbol for c rrespondlng amino acid, said 

p ptide exhibiting a binding affinity for th SH3 d main of 
Src, provid d that said peptide is n t R-P-L-P-P-L-P-T-S {SEQ 
ID NO: 11). In a particular embodiment of the present 
S invention, the peptides also exhibit a binding affinity for 
the 8H3 domain of Src-related proteins, including Yes, Pyn, 
Lyn, LOc, Hck and Pgr. 

The present invention also contemplates SH3 domain^ 
binding peptides that further comprise a c-terminal-f lanJcing 

10 amino acid sequence of the formula 10, 10-11, 10-11-12, 10- 
11-12-13 (SEQ ID NO:12) or 10-11-12-13-14 (SEQ ID M0:13), in 
which each number represents any amino acid residue except 
cysteine, such that 10 is bound to 9 by a peptide bond. 
Furthermore, peptides are also provided which further 

15 comprise an N-terminal-f lanking amino acid sequence of the 
formula 1*, 2»-l', 3«-2'-l' or 4'-3'-2'-l' (SEQ ID NO:l4) in 
which each number represents any amino acid residue except 
cysteine, such that 1' is bound to R by a peptide bond. 

Thus, in a particular embodiment, a peptide is disclosed 

20 having at least thirteen and up to forty-five amino acidi 
residues, including an amino acid sequence of the formula, 
3»-2 •-l'-R-2-L-P-5-6-P-8-9-10 (SEQ ID NO:15), positioned, 
anywhere along the peptide, in which each number represents 
an amino acid residue, such that 3*, 2', 1*, 2, 8, and 10 

25 each represents any amino acid residue except cysteine, 5 and 
6 each represents a hydrophobic amino acid residue, and 9 
represents a hydrophilic amino acid residue except cysteine, 
each letter being the standard one-letter symbol for the 
corresponding amino acid, said peptide exhibiting a binding 

30 affinity for the SH3 domain of Src. 

The present invention also seeks to provide new 
consensus sequences or motifs that reflect variations in SH3 
domain binding selectivities or specificities. The present 
invention also contemplates conjugates of the SH3 binding 

35 peptid s and a s cond mol cul or chemical moiety. This 
s cond molecule may be any desir d substance whose delivery 
to the regi n of th SH3 domain of a particular prot in (or 
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c 11 c ntainlng the protein) is s ught. Possible target 
c lis include, but are not limited to, n ural cells, immune 
cells (e.g., T cells, B cells, natural killer c lis, and th 
like), osteoclasts, platelets, epidermal cells, and the like, 
5 which cells express Src, Src-related proteins, and 

potentially, other SH3 domain-containing proteins. In this 
manner, the modulation of the biological activity of proteins 
bearing an SH3 domain can be accomplished. 

Other methods and compositions consistent with the 

10 objectives of the present invention are likewise disclosed. 
In particular, a method is disclosed of modulating the 
activity of Src or Src-related proteins comprising 
administering a composition comprising an effective amoiint of 
a peptide of the present invention and a carrier, preferably 

IS a pharmaceutically acceptable carrier. In a specific 

embodiment, the contemplated method results in the inhibition 
of the activity of Src or Src-related proteins. 
Alternatively, the method is effective to activate Src or 
Src-related proteins. 

20 In yet another embodiment, a method is disclosed of 

identifying a peptide having a region that binds to an SH3 
domain comprising: (a) providing an immobilized target 
protein comprising an SH3 domain; (b) incubating the 
immobilized target protein with an aliquot taken from a 

2S random peptide library; (c) washing unbound library peptides 
from the immobilized target protein; (d) recovering the 
peptide bound to the immobilized target protein; and (e) 
determining the primary sequence of the SH3 domain-binding 
peptide. 

30 Moreover, a method is disclosed of imaging cells, 

tissues, and organs in which Src or Src-related proteins are 
expressed, which comprises administering an effective amount 
of a composition comprising an SH3 domain-binding peptide 
conjugated to detectable label or an imaging agent. 

35 Other obj ctives of th present invention will become 

apparent to one of ordinary skill in th art after 
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consideration of th abov di&cl siire and the foil wing 
detailed descripti n of th pref rr d embodin nts. 

4. Brief Dee orlntien of the yiouree 
S FIG. 1 illustrates a scheme for the generation of a 

random 36 amino acid peptide library (TSAR-9; e.g., SEQ ID 
NO: 16). Oligonucleotides were synthesized (SEQ ID NOS:17- 
18) , converted into double-stranded DNA, cleaved with 
restriction enzymes (SEQ ID MOS: 19-20), and cloned into the 
10 M13 vector, m663. The random peptide region encoded by the 
oligonucleotides is shown in the box (SEQ ID NO: 16) and is 
situated at the N-terminus of mature protein III (SEQ ID 
NO:21) . 

FIG. 2 illustrates a scheme for the generation of a 
15 random 22 amino acid peptide library (TSAR-12; e.g., SEQ ID 
NO:23). Oligonucleotides were synthesized (SEQ ID NOS:24- 
25) , converted into double-stranded DNA, cleaved with 
restriction enzymes (SEQ ID NOS: 26-27), and cloned into; the 
M13 vector, m663. The random peptide region encoded by the 
20 oligonucleotides is shown in the box (SEQ ID NO: 23) and- is 
situated at the N-terminus of mature protein III (SEQ ID 
NO:28) . 

FIG. 3 illustrates a scheme for the generation of a 
random 8 amino acid peptide library (R8C; SEQ ID NO:30) . 

25 Oligonucleotides were synthesized (SEQ ID NOS: 31-32), 

converted into double-stranded DNA, cleaved with restriction 
enzymes (SEQ ID NOS: 33-34), and cloned into the N13 vector, 
m663. nie random peptide region (SEQ ID NO: 30) is flanked by 
cysteine residues and is situated at the N-terminus of mature 

30 protein III (SEQ ID NO: 35). 

FIG. 4 illustrates the possible origin of one class of 
double-insert R8C recombinants (e.g., encoding SEQ ID NO: 36). 
Double-stranded oligonucleotides (e.g., SEQ ID NO: 37) may 
have ligated in a head-to-head fashion at the Xi>a I site 

35 pri r t cloning in the Xho I- XJba I cl av d M13 v ctor. 

FIG. 5 sh ws a list of random p ptide recombinants (SEQ 
ID NOS: 38-61 and 106) isolat d by the method of the pres nt 
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inv ntl n and the display d peptld s qu nee. The amln acid 
sequences ar aligned to highlight the core seguenc s. The 
flanking sequences ar shown to the N-terminal and C-t rmlnal 
ends of the core sequence. 
S FZ6. 6 graphically Illustrates the relative binding 

affinities of selected phage clones for various SH3 domains. 
The results indicate that certain amino acid sequences 
provide generic SH3 domain binding, while others can provide 
greater selectivity for the SH3 domain of Src. Still other 

10 clones exhibit Src SH3 domain preferential binding. 

FIG. 7 shows the binding of synthetic peptides (SEQ ID 
NOS:9 and 62-70) representing Src SH3-selected phage inserts 
to Src SH3-GST fusion target (shaded columns) over background 
GST binding (unshaded columns) relative to the core peptide 

15 RPIJ>PLP (SEQ ID NO: 9) and proline-rlch peptide segments 
derived from naturally occurring proteins. Bound 
biotinylated peptide was detected with streptavidin-alkaline 
phosphatase ELISA. Each point was performed in triplicate; 
average absorbance at 405 nm is presented. Error bars 

20 represent SD. 

FIG. 8 Illustrates the relative specificity of selected 
peptides (SEQ ID NOS:9 and 62*70) for SH3 domains derived 
from different proteins. In particular, the binding 
affinities of the peptides for the SH3 domains of the 

25 following protein fusion targets were tested: Src SH3-GST, 
Yes SH3-GST, Grb2-GST, Crk SH3-GST, Abl SH3-GST, PLC7I 
SH2SH3-GST. Bound biotinylated peptide was detected with 
streptavidln«»alkaline phosphatase. Each point was performed 
in triplicate; values are average signal (absorbance at 405 

30 nm) above GST background, with error bars representing 

standard deviation. Hatched bars indicate saturation of the 
ELISA signal. 

FIG. 9 presents the results of competition experiments 
in which selected peptides were found to inhibit the binding 
35 of pr t ins from cell lysates to immobilized Src SH3-GST r 
Abl SH3-GST protein fusion targets. 
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FIG. 10 pr sents a graph illustrating the incr as d rate 
f progest rone- induced maturati n of oocytes inj cted with 
an SH3 domain-binding p ptid , VLKRPLPIPPVTR (SEQ ID NO: 64), 
of the present invention. Briefly, Stage VI oocyted were 
5 prepared and injected as previously described (see, Kay, 
B.K., Mfttn^^ ^» g^^^ (1991) 36S663-669). Oocytes 

were injected with 40 nL of 100 mM test peptide or water. 
After injection, the oocytes were placed in 2 /ig/inL 
progesterone (Sigma, St. Louis, MO) and scored hourly for 
10 germinal vesicle breakdown (GVBD) . 

FIG. 11 shows the results of fluorescence experiments in 
which certain peptides. Panel A » VLKRPLPIPPVTR (SEQ ID 
ho: 64), Panel B - GILAPPVPPRNTR (SEQ ID NO: 63), Panel C - 
RSTPRPLPPLPTTR (SEQ ID NO: 67), of the invention were shown to 
15 localize within cellular compartments thought to contain Src 
or Src-related proteins. 

FIG. 12 illustrates a scheme for the generation of a 
biased peptide library. Oligonucleotides were synthesized, 
converted into double-stranded DNA, cleaved with restriction 
20 enzymes Xhol and Xbal, and cloned into the mBAX vector, 

described further below in the Examples section. The bias d 
peptide region is situated at the N-terminus of mature pIII 
protein. 

FIG. 13 illustrates the peptide sequence encoded in the 
25 mBAX vector situated at the N-terminus of mature pi I I 
protein. 

5. Detailed Description of the Preferred Bmbodimeats 
S.l. oe&eral Considerations 

30 The present invention relates to peptides that 

exhibit a binding affinity for an SH3 domain, which domain 
has been found to be present in a number of physiologically 
significant proteins. In particular, peptides are disclosed 
which exhibit general binding characteristics to the SH3 

35 domains fotmd in a group f prot ins, including but not 
limit d t Abl, Src, Grb2, PLC-«, PLC-7, Ras GAP, Nek, and 
p85 PI-3' Kinase. Pref rr d peptid s exhibit sel ctive, if 
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not specific, binding affinity for the SH3 domain of Src* As 
d scribed herein, th peptides of the pr s nt inventi n 
include a c re s quence, pr f rably a consensus segeunce^ and 
additional amino acid residues that flank the core secnienc • 
S These peptides, including the methods for their 

identification, are described in greater detail, below. 

Thus, in a specific embodiment of the invention, 
peptides are provided which have at least nine and up to 
about forty-five amino acid residues, including an eunino acid 

10 seqpience resembling the formula, 

R-2-L-P-5-6-P-8-9 (SEQ ID NO:10), 
positioned anywhere along the peptide. In the above- 
mentioned formula, each ntimber represents an amino acid 
residue, such that 2 represents any amino acid residue except 

15 cysteine, 5 and 6 each represents a hydrophobic amino acid 
residue, 8 represents any amino acid residue except cysteine, 
and 9 represents a hydrophilic amino acid residue except 
cysteine. Each letter used in the formulas herein represent 
the standard one-letter symbol for the corresponding amino 

20 acid. When the peptide is a 9-mer, the peptide 

R-P-L-P-P-L-P-T-S (SEQ ID NO: 11) is excluded. The peptides 
of particular interest are those that exhibit a binding 
affinity for the SH3 domain of Src and Src-related proteins, 
including Yes, Pyn, Lyn, Lck, Hck and Pgr. Preferably, the 

2S peptides of the invention exhibit a binding affinity for the 
SH3 domain of Src, which is at least three-fold, more 
preferably at least four-fold, most preferably at least about 
five-fold greater than that exhibited by the peptide RPLPPLP 
(SEQ ID NO: 9). In still other embodiments, the peptides 

30 exhibit a binding affinity for the SH3 domain of Src which is 
at least ten- fold greater than that exhibited by the peptide 
RPLPPLP (SEQ ID NO: 9). 

In specific embodiments, peptides are disclosed in which 
the various amino acid residues at the indicated positions 

35 may ind pendently have th following preferr d id ntities: 2 
is a P, R, A, L, Q, E or S, more preferably P or R; 5 
represents a P, M, I or L, more preferably P or M; 6 is a P, 
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I or V, more preferably P or L; 8 Is a R, P, I, N, E, 
V, A, G r L, m r preferably T or R; and 9 is a T, R, 
H or D, m r pr ferably T r R* Despit the preference for 
hydrophobic amino acid residues at 5 and 6, in some cases it 
5 Biay be desirable to have hydrophilic amino acid residues at 
these positions* Specifically, amino acid residue 5 may be a 

R or S, and amino acid residue 6 Biay be a T or R« 
Likewise, while a hydrophilic amino acid residue is preferred 
at position 9, in some instances a hydrophobic residue, such 

10 as a P or A, may be desirable « 

The present invention also contemplates SH3 domain- 
binding peptides with a minimum length of 10, 11, 12, 13, 14, 
15 or more amino acids. Such peptides contain additional 
amino acid residues flanking the core sequence of 

15 R-2-L-P-5-6-P (SEQ ID NO: 71) either at the C-terminal end," 
the N-terminal end or both. Thus, for example, such peptides 
include those that further comprise a C-terminal-f lanking^ 
amino acid sequence of the formula 10, 10-11, 10-11-12, 10- 
11-12-13 (SEQ ID NO:12) or 10-11-12-13-14 (SEQ ID NO:13), in 

20 which each number represents any amino acid residue except 
cysteine, such that the amino acid residue 10 is bound to the 
amino acid residue 9 by a peptide bond. In that case, 
specific embodiments include an amino acid residue 10 which 
is T, R, L, S, D, P, A or M, preferably T or R, an eunino acid 

25 residue 11 which is R, P, A, Q, S or T, preferably R or P, an 
amino acid residue 12 which is P, S, R or T, preferably P or 
S, an amino acid residue 13 which is P, S, R, F, H or T, 
preferably P or S, and an amino acid residue 14 which is S, 
R, 6 or T, preferably, S or R« 

3.0 Furthermore, peptides are also provided which further 

comprise an N-terminal-f lanking amino acid sequence of the 
formula 1*, 2'-l', 3»-2*-l« or 4»-3»-2»-l» (SEQ ID NO:14) in 
which each number represents any amino acid residue except 
cysteine, such that 1* is bound to R by a peptide bond. In 

35 such a cas , specific mbodiments are provided in which the 
amino acid residu 1« is T, P, S, N, F, W, K, H, Q or G, 
preferably T or P, wh rein the amino acid residue 2« is S, T, 
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P, R, Q, L, A or H, preferably S or T, wher in th amino 
acid residu 3* is R, s, P, G, A, V, Y or L, preferably s r 
T, and wherein the amin acid residu 4* isR, s, V, T, G, L 
or F, preferably R or S. 
5 In a particular embodiment, a peptide is disclosed 

having at least thirteen and up to forty-five amino acid 
residues, including an amino acid sequence of the formula, 
3»-2*-l«-R-2-L-P-5-6-P-8-9-10 (SEQ ID NO:15), positioned 
anywhere along the peptide, in which each number represents 
10 an amino acid residue, such that 3", 2', 2, 8, and 10 

each represents any amino acid residue except cysteine, 5 and 
6 each represents a hydrophobic amino acid residue, and 9 
represents a hydrophilic amino acid residue except cysteine, 
each letter being the standard one-letter symbol for the 
15 corresponding amino acid, said peptide exhibiting a binding 
affinity for the SH3 domain of Src. Preferred 13-mers 
include, but are not limited to, those having an amino acid 
residue 5 which is a P or M, an amino acid residue 1" which 
is T, P, S or N, an amino acid residue 2» which is S or T, an 
20 amino acid residue 3» which is R or S, and an amino acid 
residue 10 which is T or R, In all the SH3 domain-binding 
peptides described herein, the prohibition against the use of 
the hydrophilic amino acid residue cysteine (C) does not 
extend beyond the 7-mer "core" sequence and the additional 
25 amino acid residues flanking the core up to a total (core + 
flanking) of about 20 amino acids. That is, the occasional 
use of a cysteine is not absolutely prohibited. What should 
be kept in mind is that the potential for the formation of 
intramolecular disulfide bonds, to form a cyclic structure, 
30 be minimized as much as possible. Applicants have found that 
cyclized structures appear to be disfavored, at least with 
potential binding peptides of less than about 15 amino acid 
residues in length. The concern for the formation of 
cyclized structures comprising the core sequence diminishes 
35 with increasing size of the peptide. Presumably, a large 
en ugh structure, though cyclic, may allow the critical core 
sequ nc to ad pt a m re or less lin ar conformation. 
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In particular, specific p ptldes ar dlsclos d which 
exhibit binding affinities to SH3 d mains. These Include th 

peptides, RSTPRPLPMLPTTR (SEQ ID NO. 62), RSTPRPLPPLPTTR (SEQ 
ID HO. 67), GILAPPVPPRIITR (SEQ ID NO. 63), VLKRPLPIPPVTR (SEQ 
5 ID NO. 64), GPHRRLPPTPATR (SEQ ID NO. 65), and ANPSPATRPLPTR 
(SEQ ID NO. 66) . 

Phage clones are also disclosed, along with the amino 
add sequences that are responsible for SH3 domain binding. 
These phage clones are Identified In Figure 5. 

10 In other embodiments of the present Invention, SH3 

domain-binding peptides are contemplated which have a total 
of 11, 13, 14, 18, 20, 22, 23, 25, 30, 36, 38 or 45 amino 
acid residues. 

The peptides of the present Invention, having been 

15 disclosed herein, may be prepared by any nvunber of 

practicable methods. Including but not limited to solution- 
phase synthesis, solid-phase synthesis, protein expression by 
a transformed host, cleavage from a naturally-derived, 
synthetic or seml-synthetlc polypeptide, or a combination of 

20 these technlcpies. 

The 8H3 binding peptides exhibit a wide range of 
biological activity which Includes the enhancement (or 
Inhibition, depending on the particular peptide or the natur 
of the peptide's target molecule. In this case a protein 

25 bearing an SH3 domain) of the natural function or biological 
activity of the peptide's target molecule. For example, the 
Interaction of the binding peptide of the present Invention 
could result In the modulation of the oncogenic activity of 
the target molecule bearing the SH3 domain. If the target 

30 molecule has. In turn, a natural binding partner or llgand, 
the peptides of the present Invention may also exhibit 
antagonistic or agonistic activity In relation to the 
biological activity of the natural binding partner. 

Thus, It Is an object of the present Invention to 

35 provide a m thod of activating Src r Src-related pr t In 
tyrosine klnas s by administering an eff ctlve amount of the 
SH3 domain-binding peptld s g nerally described herein. Th 
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Lnt nslty of iaunune response can thus be stlmulat d, f r 

exaapl , by th incr ased product! n of certain lymphokines , 
such as TNF-alpha and inter leukin-1 . As is generally known 
to those of ordinary skill in the art^ a more intense immune 
5 response may be in order in certain conditions, such as in 
combating a particularly tenacious infect ion « viral or 
otherwise, or a malignancy. 

Furthermore, in a specific embodiment of the present 
invention, a conjugate compound is contemplated which 

10 comprises the peptide of the present invention and a second 
chemical moiety. The second chemical moiety can be selected 
from a wide variety of chemical compounds including the 
peptide itself. Typically, however, the second chemical 
moiety is selected to be other than the peptide of the 

15 present invention, including but not limited to an amino 
acid, a peptide other than an SH3 binding peptide of the 
present invention, a polypeptide or protein (i.e., the 
conjugate is a fusion protein) , a nucleic acid, a nucleosid , 
a glycosidic residue (i.e., any sugar or carbohydrate), a 

20 label or image-enhancing agent (including metals, isotopes, 
radioisotopes, chromophores , fluorophores (such as FITC, 
TRITC, and the like) , and enzyme substrates) , a drug 
(including synthetic, semisynthetic, and naturally-occurring 
compounds), small molecules (e.g., biotin, hormones, factors) 

25 and the like. 

The peptide of the present invention can be conjugated 
to the second chemical moiety either directly (e.g., through 
appropriate functional groups, such as an amine or carboxylic 
acid group to form, for example, an amine, imine, amide, 

30 ester, acyl or other carbon-carbon bond) or indirectly 
through the intermediacy of a linker group (e.g., an 
aliphatic or aromatic polyhydroxy, polyamine, polycarboxylic 
acid, polyolef in or appropriate combinations thereof) . 
Moreover, the term con jugate, as used herein, is also meant 

35 to encompass non-covalent interact! ns, including but not 
limited to ionic, affinity or other compl xation 
interactions. Preferably, such other non-covalent 
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interact:! ns provid d finable, most pr ferably, is latable 
chemical conjugate sp cies. 

As described further herein, the peptides of th present 
invention have been shown to localize within certain cellular 
S compartments which contain Src or Src-related proteins. 
Consequently, the above-described conjugate can be utilized 
as a delivery system for introduction of a drug to cells, 
tissues or organs that include SH3 domains-containing 
proteins . 

10 It should also be pointed out that the present invention 

seeks to provide a recombinant construct comprising a nucleic 
acid or its complement that includes codons or nucleotide 
sequences encoding a peptide having a region that binds to an 
SH3 domain, preferably the Src SH3 domain. The recombinant 

X5 nucleic acid may be a DNA or RNA polynucleotide. 

In a specific embodiment, the present invention 
contemplates a recombinant construct which is a transforming 
vector. Such vectors include those well known to those of 
ordinary skill in the art, which effect the transfer or 

20 expression of the nucleotide sequence after introduction to a 
host, such as recombinant plasmid, phage or yeast artificial 
chromosome. These vectors may be closed circular loops or 
they may be linearized. The vectors contemplated include 
those that exist extrachromosomally after host transformation 

25 or transf action, as well as those that integrate within or 
even displace portions of the host chromosome. The vectors 
may be introduced to the cell with the help of transfection 
aids or techniques well-known in the art. For example, these 
aids or techniques may take the form of elect r operation, us 

30 of calcium chloride, calcixim phosphate, DEAE dextran, 
liposomes or polar lipid reagents kno%m as LIPOFECTIN or 
LIPOFECTAMINE. In addition, the present invention 
contemplates the direct introduction of the desired nucleic 
acid to the host cell, for instance, by injection. 

35 Transformed h st c lis ar als obtained by the methods 

of th present invention which ar capable of reproducing the 
p lynucl otid s quences of int r st and/or expressing th 
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c rrosponding peptid products. A vari ty of h sts are 
c ntemplat d, including prokary tic and eukaryotic hosts. In 
particular, bacterial, viral, yeast, animal, and plant cells 
are potentially transformable hosts. Thus, a method is 
S disclosed to obtain a transformed host cell that can produce, 
preferably secrete, a peptide having a region that binds to 
an SH3 domain comprising (a) providing an expression vector, 
preferably a secretory expression vector, comprising a 
nucleotide sequence encoding at least one copy of a peptide 
10 having a region that binds to an SH3 domain; and (b) 
introducing the vector to a competent host cell. 

The peptides, thus produced, may then be introduced to 
cells, tissues, organs, or administered to the subject for 
the purpose of modulating the biochemical activity of the SH3 
15 domain-containing proteins present therein. Accordingly, in 
specific embodiments of the present invention, compositions 

provided which comprise an SH3 domain-binding peptide, 
including a core sequence and flanking sequences, and a 
suitable carrier. 
20 The compositions contemplated by the present invention 

may also include other components, from those that facilitate 
the introduction or administration of the compositions to 
those that have their own innate activity, such as a 
prophylactic, a diagnostic or a therapeutic action. Such 
25 innate activity may be distinct from that of the peptides of 
the present invention or be complementary thereto. In any 
event, the compositions of the present invention include 
those that are suitable for administration into mammals, 
including humans. Preferably, the compositions (including 
30 necessarily the carrier) of the present invention are 
sterile, though others may need only be cosmetically, 
agriculturally or phazmaceutically acceptable, still other 
compositions may be adapted for veterinary use. 

The compositions, including the drug delivery systems 
35 d scribed h r in, are contemplat d to be administered in a 
variety of ways, such as parent rally, orally, ent rally, 
t pically or by inhalation. Th c mpositions may also be 
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adminstered intranasally, pthalmically or In^ravaginally . 
Fuirthera r , the c mposltions f the inv nti n can tak 
several forms, such as solids, gels, liquids, a r s Is or 
patches • 

5 In another embodlnent of the present Invention a method 

Is provided of Identifying a peptide having a region that 
binds to an SH3 domain comprising: (a) providing an 
dlmmoblllzed target protein comprising an SH3 domain; (b) 
Incubating the Immobilized target protein with an aliquot 

10 taken from a phage-dlsplayed random peptide library, which 
library Includes peptides having a random sequence of 
amino acid residues; (c) washing unbound phage from the 
Immobilized target protein; (d) recovering the phage bound to 
the Immobilized target protein; and (e) determining the 

15 relevant nucleotide sequence of said binding phage nucleic 
acid and deducing the primary secpience corresponding to the 
SH3 domain-binding peptide. Preferably, the method further 
comprises amplifying the titer of the recovered phage and 
repeating the steps of Incubation, washing and recovery to 

20 provide SH3 domaln-blndlng peptlde-enrlched phage. 

Any other mode by which the peptide library, random or 
otherwise, can be "displayed** can be utilized In the present 
Invention, however. Moreover, the present applicants believe 
that longer random peptide sequences (e.g., >6 amino acid 

25 residues, preferably >10, and most preferably, >12) provide 
not only much greater diversity but also a richer degree of 
secondary structure conducive to binding activity. If the 
random region of the peptide Is less than or ecpial to an 8- 
mer. It should preferably not be cycllzed. 

30 

5.2. Preparation of Random Peptide Libraries 

The preparation and characterization of the 
preferred phage*displayed random peptide libraries have been 
described elsewhere. See, for example, Kay, B.K. et al. in 
35 Gene (1992) 128:59-65, for a description of the preparation 
of th phage-displayed rand m peptide library known as TSAR- 
9, more below. In particular, by cloning deg n rat 

- 21 - 

SUBSTITUTE SHEET (RULE 26) 

BNSOCX^tD: «^_960364eA1JB> 



wo 96A>3649 



PCT/US9SA)9382 



llgonucl otid s of fix d 1 ngth int bacteriophage vectors, 
r combinant librari s f random peptides can be generated 
which ar xpress d at th amino^-t nainus of the pIII protein 
on the sxirface of M13 viral particles. (There are 3-5 copies 
5 of the plll-fusion on the surface of each particle.) Phage 
display offers several conveniences: first,, the expressed 
peptides are on the surface of the viral particles and 
accessible for interactions; second, the recombinant viral 
particles are stable (i.e., can be frozen, exposed to pH 

10 extremes); third, the viruses can be amplified; and fourth, 
each viral particle contains the DNA encoding the recombinant 
genome. Consequently, these libraries can be screened by 
isolating viral particles that bind to targets. The isolates 
can be gro%m up overnight, and the displayed peptide sequence 

IS responsible for binding can be deduced by DNA secpiencing. 
These libraries have approximately >10' different 
recombinants, and nucleotide sequencing of the inserts 
suggests that the expressed peptides are indeed random in 
amino acid seqfuence. These libraries are referred to herein 

20 as TSAR libraries, where TSAR stands for Totally Synthetic 
Affinity Beagents. The preparation of the TSAR libraries are 
described further below. 

BBS Blading Clones And Their Character isties 

2S Accordingly, peptides have been isolated from an 

unconstrained random peptide library which exhibit a binding 
affinity for SH3 domains. Furthermore, the binding 
affinities exhibited by the disclosed peptides differ in 
their selectivities with certain peptides showing comparable 

30 binding affinities for SH3 domains derived from different 
proteins, while others manifest greater affinities for 
specific SH3 domains. 

The amino acid sequence of various peptides isolated by 
the present method are listed in Figure 5. As can be seen 

35 from this list, certain groups of SH3 domain binding peptid s 
are isolated fr m thre separat random p ptide libraries, 
each based on a diff rent type of random peptide insert, all 
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displayed at the amlno-terminus of th pi I I protein on th 
stirface of M13 viral particles* T n clon s w r isolated 
from the R8C libraxry, s v n f r m th TSAR-12 library, and 
seven fro© the TSAR-9 library. The sequences are presented 
S to highlight the particular amino acid residues believed t 
bind directly to the SH3 domain, as well as to point out th 
remaining amino acid resiudes of the random insert and the 
viral flanking sequences and complementary site amino acid 
residues common to each group of clones. The frequency with 

10 which each particular clone is found in each library is also 
indicated in Figure 5. Thus, clones T12.SRC3.1 and 
T12.SRC3.2 are by far the most abundant clones found among 
the three libraries. 

Interestingly, all the binding peptides are found to 

IS have the proline-rich amino acid residue motif, which is 
apparently responsible for binding, the motif being located 
predominantly at the C-terminal end of the insert, although 
each clone also contains an insert at the N-terminal end.^ 
The significance of this observation is not presently 

20 understood, although this finding may indicate the possible 
importance of the C-terminal viral flanking sequences in SH3 
domain binding. 

Indeed, a synthetic peptide bearing only the core 
consensus sequence RPLPPLP (SEQ ID NO: 9) was less effective 

25 in binding to target SH3 domains than synthetic peptides that 
also included additional amino acid residues flanking the 
core secjuences. Thus, 13-mers and 14-mers having the 
sequences RSTPRPLPMLPTTR (SEQ ID NO: 62), RSTPRPLPPLPTTR (SEQ 
ID NO: 67), GIIAPPVPPRNTR (SEQ ID NO: 63), GPHRRLPPTPATR (SEQ 

30 ID NO:65), and VLKRPLPIPPVTR (SEQ ID NO:64) have been 

prepared and shown to bind to SH3 domains, such as those of 
Src and Yes, much more avidly than the 7-mer, RPLPPLP (SEQ ID 
NO: 9). The 13-mer ANPSPATRPLPTR (SEQ ID NO: 66) has been 
shown to have binding affinities comparable to the core 

35 consensus s qu nee. In each case, th 13-m rs compris a 7- 
m r "core"' s quence plus additional coaino acid residu s 
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flanking saa , s me of which additional amino acid residues 
are c ntributed by th viral flanking s guences. 

Thus, in on embodim nt of the present invent! n, a 7- 
mer core includes a consensus motif of the formula RXLP00P 
5 (SEQ ZD NO:71), wherein R is arginine, L is leucine, P is 
proline. X represents any amino acid except cysteine and 0 
represents a hydrophobic amino acid residue. By "hydrophobic 
amino acid residue,** the applicants mean to include F, Y, W, 
V, A, I, L, P or M, each letter representing the standard 
10 one-letter designation for the corresponding amino acid 
residue. 

Furthermore, a preferred 9-mer peptide comprising two 
additional amino acids on the C^terminal end of the core 
sequence is envisioned having a consensus motif of the 

IS formula RXLP00PX^. In this preferred 9-mer consensus motif, 
the symbol ^ represents a hydrophilic amino acid residue, 
except cysteine. By "hydrophilic amino acid residue," the 
applicants mean to include K, R, H, D, E, N, Q, T, S or C, 
and the other symbols are as defined above. For the purpos s 

20 of the present invention, a glycine residue (G) may be 

considered either a hydrophobic or a hydrophilic amino acid 
residue. The one-letter symbols B and Z, which stand for N 
or D and Q or E, respectively, are considered hydrophilic 
amino acid residues. 

25 Particular 13-mer peptides of the present invention 

include those listed, below. It is noted, however, that not 
all the following 13-mer peptides correlate strictly to or 
comply with the preferred 9-mer consensus motif, described 
above. Those peptides that do not comply (indicated in 

30 italics, with the non-complying amino acid residues 

underscored) can, thus, be described as "resembling" those 
that do comply (indicated in normal type) with the preferred 
9-mer consensus motif: P6FRELPPLPPSR (SEQ ID NO: 72), 
AQSRPIiPIPPETR (SEQ ID NO:73), VLKRPLPIPPVTR (SEQ ID NO:64), 

35 PPN&PLPPLPTHL (SEQ ID NO:74), TGRQPLPPLPNDS (SEQ ID NO:75), 
YSTRPi^PPJXRtS (SEQ ID VOtie) , SHK&RLPPLPTRP (SEQ ID NO:77), 
YRFRALP&PPSA^ (SEQ ID NO:78) , GPHRRLPPXPATR (SEQ ID MO: 65), 
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LAQROLPPTPORD (SEQ ID NO:79) , ALQRRLPB^PPEA (SEQ ZD NO: 80), 
PATRPLPXBPSRT (SEQ ID NO:81) , YSTRPLPSBPSRT (SEQ ID NO: 82), 
XPGRILLftPSBER (SEQ ID NO:83) , SGGZL^PVPPPN (SEQ ID NO:84) , 
RSTRPLPILPRTT (SEQ ID NO:85) , STPRPLPMLPTTR (SEQ ID NO:86) , 
5 STNRPLPMIPTTR (SEQ ID NO:87) , RSTRPLPS.LPITT (SEQ ID NO:88) , 
STSRPLPSXJPTTR (SEQ ID NO:89) , RSTRSLPPLPPTT (SEQ ID NO:90) , 
RSTRQUPIPPTTT (SEQ ID NO: 91), STPRPLPLIPTTP (SEQ ID NO: 92), 
RSTRPLPPTPI'TT (SEQ ID NO:93) , and RSTRPQPPPPITT (SEQ ID 
NO: 94). Accordingly, other peptides not specifically 

10 disclosed, which either comply with or "resemble** the 

preferred 9-mer consensus motif, can be readily envisioned by 
those of ordinary skill in the art and are considered to b 
equivalent to those that are specifically disclosed above. 
In particular, non-compliance at positions 1 (S, G, and I, in 

IS place of R, are tolerated) , 3 (V, A, and Q, in place of L, 
are tolerated) , 4 (L, in place of P, is tolerated) ,5 
(hydrophilic amino acid residues, S, R, and T, are tolerat d 
in place of a hydrophobic amino acid residue) , 6 (hydrophilic 
amino acid residues, R and T, are tolerated in place of a 

20 hydrophobic amino acid residue), 7 (T, and S, in place of P, 
are tolerated) , and 9 (P and A are tolerated in place of a 
hydrophilic amino acid residue) have been observed. 

s.3.1. Bindiag speoif ioities 

25 It has been discovered that certain of the 

binding peptides disclosed have a greater relative binding 
affinity for one SH3 domain over another. Referring now to 
Figure 8, the relative binding affinities of the various 
peptides described above toward different SH3 domain targets 

30 are graphically presented. As one can see, the relative 
binding affinities of the respective peptides can differ by 
orders of magnitude. Thus, as sho«m in Figure 8, the peptid 
6PHRRLPPTPATR (SEQ ID NO: 65) , having the relevant sequence of 
the phage clone identified as T12.SRC3.3, is specific to Src 

35 family SH3 domains, including, but n t limited to, src. Yes, 
Lck, Hck, Fgr, Fyn, and Lyn. This 8H3 binding peptide has 
littl affinity for SH3 domains deriv d from PLC7 or Grb2. 
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On the th r hand, th peptide GZIAPPVPPRNTR (SEQ ID NO: 63), 
c rr spending to the rel vant sequ nee of the phag cl n 
T12.SRC3.1, which is one of the m st abundant binding clones 
found by the present method, binds generically to a broad 
S range of SH3 domains, including Src, PLC7, and Grb2. 

On an intermediate level,, the present invention has also 
uncovered a peptide, VLKRPLPIPPVTR (SEQ ID NO: 64), 
corresponding to the relevant sequence of the phage clone 
T12»SRC3.6, which is Src preferential; that is, this peptide 

10 exhibits strong binding affinities for members of the Src 
family, some binding affinities for 6rb2 proteins, but littl 
binding affinities for PLC7 domains. The peptide 
ANPSPATRPLPTR (SEQ ID NO: 66), corresponding to the relevant 
sequence of the phage clone T12.SRC3.2, also exhibits Src 

IS family specificity similar to GPHRRLPPTPATR (SEQ ID NO: 65). 
The peptides RSTPRPLPMLPTTR (R8C.YES3.5; SEQ ID NO: 62) and 
RSTPRPLPPLPTTR (representative consensus motif; SEQ ID NO: 67) 
are highly specific for SH3 domain of Src, Yes, and other 
Src-related proteins. 

20 

5.4. Turther Discussion of Binding Siqperime&ts 

At the outset it is apparent that the binding 
affinity of certain peptides to the SH3 domain of Src and 
Src-related proteins is governed by more than just the 

25 presence of the preferred core consensus sequences, RPLPPLP 
(SEQ ID NO:9) or RPLPMLP (SEQ ID NO:95; i.e., RPLP(P/M)LP, 
SEQ ID NO: 96). Thus, while the synthetic peptides 
RSTPjePXiP/fiPTTR (R8C.YES3.5; SEQ ID NO: 62) and RSTPRPLPPiPTTR 
(consensus; (SEQ ID NO: 67) exhibit a strong specific binding 

30 affinity for Src SH3, the other synthetic peptides tested 
also exhibited an avid binding affinity to SH3 domains 
relative to the 7-mer, RPLPPLP (SEQ ID NO: 9). These other 
peptides, GILAPPVPPRNTR (SEQ ID NO: 63), VLKEPIrPJPPVTR (SEQ ID 
NO:64), GPHRRLPPTPATR (SEQ ID NO: 65), and ANPSPATRPLPTR (SEQ 

35 ID NO:66) , sport cor sequ nc s and flanking s quenc s that 
do n t losely adhere to the pref rred cor consensus 
sequ nc s. Thus, th s results sugg st that binding affinity 
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t SH3 domains is govern d to a larg ext nt by the nature of 
th amino acid residu s flanking th c re 7-m r sequ nee. 

The binding charact ristics f Src SK3-8 lected p ptides 
was determined using synthetic biotinylated peptides 
S corresponding to the sequences displayed by Src SH3-selected 
phage. These biotinylated peptides were assayed for direct 
binding to immobilized Src SH3-K3ST. Each of the five 
library-derived peptides tested were found to bind to Src 
SH3-GST and Yes SHS-GST over background (Figure 8) . 

10 Furthermore, a strong correlation was observed between the 
similarity of a given peptide to the preferred core consensus 
sequence RPLP(P/M)LP and the peptide's affinity for Src SH3- 
6ST. The core sequence of the clone T12.SRC3.1 
(GIIiAPPVPPJlNTR; SEQ ID NO: 63) appears to provide more generic 

IS SH3 domain-binding characteristics. 

Experiments comparing the relative binding of various 
phage clones to SH3 domains taken from a variety of proteins 
demonstrated the preference of these clones for Src and Src- 
related SH3 domains over SH3 domains taken from other 

20 proteins. 

It was further fotand that while the 7-mer having the 
consensus sequence RPLPPLP (SEQ ID NO: 9) bound to Src SH3-6ST 
only weakly, peptides comprising the consensus sequence 
flanked by residues encoded by one of the Src SH3-selected 

25 clones (R8C.YES3.5) , RSTP (SEQ ID NO:97) at the N-terminal 
end and TTR at the C-terminal end, bound significantly better 
than any of the peptides tested (Figure 7) . Thus, as stat d 
previously, sequences that flank the RPLP(P/M)LP (SEQ ID 
NO: 96) core appear to be important contributors to SH3 

30 binding. It is fvurther surmised that a peptide having or 
resembling the sequence RSTPAPPVPPRTTR (SEQ ID NO: 98) should 
exhibit strong but generic binding to a variety of SH3 
domains . 

Similarly, it is observed that most of the Src SH3- 
35 binding m tifs are located n ar the carboxy-t rminus of the 
random peptides, adjac nt t s quences which ar f ix d in 
every clone (Figure 5) . The exc ptional clones t nd to 
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possess s quenc s that r semble motifs that include fixed 
flanking s qu noes. This clustering contrasts with previous 
results, in which binding a tifs ar distribut d throughout 
the random peptide. Kay, B*K., et al., in Gene (1993) 
S 128:59-65. 

The binding of the library-derived Src SH3 -binding 
peptides was compared to that of peptides corresponding to 
proline-rich regions of natural proteins. Peptides 
corresponding to SH3 -binding regions in human PI-3 * Kinase 

10 (KISPPTPKPRPPRPLPV; SEQ ID N0:69) and human SCSI. 20 

(6TVEPVPPPVPPRRRPESA; SEQ ID NO:68), as well as a proline- 
rich region of the cytoskeletal protein vinculin 
(lAPPKPPLPEGBV; SEQ ID NO: 70), bound Src SH3 less well than 
the library-derived peptides (Figure 7) . 

15 As mentioned above, the relative specificity of binding 

was explored. Thus, the relative binding of Src SH3 -select d 
peptides to equal amounts of GST fusions to SH3 domains from 
different proteins was determined (Figxxre 8) . While all of 
the library-derived peptides bound the Src and Yes SH3 

20 domains almost equally well, none of the peptides (with the 
exception of peptide T12.SRC3.1, the most divergent peptide 
tested) bound the SH3 domains of Grb2, Crk, Abl or PLCyl 
appreciably. Thus, the library-derived peptides, in contrast 
with a peptide derived from SOSl, exhibit SH3 binding that is 

25 relatively specific for Src-feunily members. 

Next, it was determined whether the binding to the Src 
SH3 domain was qualitatively like the interactions of the SH3 
domain and natural proteins found in cell lysates. Thus, 
radiolabeled proteins were prepared from NIH 3T3 cell lysates 

30 and chromatographed over Src SH3-GST immobilized on 

glutathione linked Sepharose. SDS-PAGE shows that a number 
of proteins can be affinity purified in this manner. The 
synthesized peptides bind quite well to the Src SH3 domain, 
as they can compete the binding of radiolabeled proteins from 

35 cell lysat s to imm biliz d Src-GST, with an IC50 of 1-10 mM 
(Figure 9) . In c nclusion, th p ptides can efficiently 
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bl ck ttx interact! n f cellular proteins with Src SH3 in 
vitro. 

N reover, X nopus laevis o cyt s inject d with mRNA 
encoding constitutively active Src undergo progesterone- 
5 induced natiuration at an accelerated rate relative to oocytes 
injected with water or c-Src xnRNA. linger, T.F, and Steele, 
R.E. in Mol, cell. Biol, (1992) 12:5485-5498. To explore the 
ability of the library-derived Src SH3 -binding peptides to 
exert a biochemical effect in vivo, the influence of the 

10 peptides on the maturation of Xenopus laevis oocytes was 
examined. Hence, stage VI oocytes were injected with 
peptide, exposed to progesterone, and scored for germinal 
vesicle breakdown. Figure 10 shows that the rate of 
maturation was accelerated by approximately one hour when 

IS oocytes were injected with the SH3 -binding peptide consisting 
of RPLPPLP (SEQ ID NO: 9) flanked by residues from clone 
T12.SRC3.6 (VUCRPLPIPPVTR; SEQ ID NO: 64), but not with water 
or a peptide corresponding to a proline-rich segment of 
vinculin (LAPPKPPLPEGEV; SEQ ID NO: 70) as controls. The 

20 magnitude of this effect is roughly equivalent to that seen 
with injection of mRNA encoding const ituitively active Src. 
See, e.g., Figxire 3B in Unger, T.F. and Steele, R.E., supra. 
This result suggests that the library-derived Src SH3-binding 
peptide is effectively relieving an inhibitory effect of the 

25 Src SH3 domain upon Src PTK activity. This model is 

consistent with a number of studies which have demonstrated 
an inhibitory effect of the Src SH3 domain upon Src kinase 
and transforming activity. See, e.g., Okada, M. , et al., 
supra; Murphy, S.M., et al., supra; and Superti-Purga, G., et 

30 al. , supra. 

5. 5. Diagnostio Jknd Therapeutic Agents 

Based On 8H3 Binding Peptides and Additional 
Methods of Their Use 

As already indicated above, the present invention 

^* also seeks to provide diagnostic, prophylactic, and 
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therapeutic agents baa d on the SH3 binding peptides 
d scribed her in. 

In one eabodim nt, diagn stic agents ar provided, 
preferably in the form of kits, comprising an SH3 domain- 
S binding peptide and a detectable label conjugated to said 
peptide directly, indirectly or by complexation.. said peptide 
comprising: (i) a core sequence motif of the formula RXLP^P 
(SEQ ID NO: 71), wherein X represents any amino acid except 
cysteine and 0 represents a hydrophobic amino acid residue, 

10 including F, Y, W, V, A, I, L, P, M or G, each letter 
representing the standard one-letter designation for the 
corresponding amino acid residue; and (ii) two or more 
additional amino acid residues flanking said core sequence at 
its C-terminal end, N-terminal end or both. 

15 The diagnostic agents of the present invention can be 

used to detect the presence of SH3 domains of a generic or 
specific type in cells, tissues or organs either in vitro or 
in vivo. For in vivo applications, the diagnostic agent is 
preferably mixed with a pharmaceutically acceptable carrier 

20 for administration, either enterally, parenterally or by some 
other route dictated by the needs of the particular 
application. 

In a particular embodiment, for example, an assay based 
on a fusion product is contemplated which comprises a Src SH3 

2S domain-binding peptide of the invention and a siibstrate for 
deregulated or "activated" Src. For instance, a muscle 
biopsy, taken from a subject suspected of being infected by 
the Rous sarcoma virus, can be treated with an effective 
amount of the fusion product. By subseqpient analysis of the 

30 degree of conversion of the substrate, one can potentially 
detect infection by the Rous sarcoma virus in the subject, 
particularly mammals, especially chickens. The presence of 
the retrovirus, which causes the expression of deregulated or 
"activated" Src, may thus be indicated by unusually high 

as levels of Src as rev aled by large amounts of th converted 
substrate. S , for example, Paxton, W.G. et al., in 
Biochem. Blophva. Res, gommun. (1994) 200 (1) : 260-267 
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(detection f phosphorylated tyr sin and s rin residu a of 
angiotensin II ATI rec pt r, a substrate of Src family 
tyrosine kinases); anther suitabl substrat may b the 
protein p68 (Fumagalli, S. et al., in Ufljiiicfi (1994) 
5 368(6474) :871-874; Taylor, S.J. and Shalloway, D. , in Ibid^ 
at 867-871. 

Alternatively, the enzyme can be isolated by selective 
binding to a form of the SH3 domain-binding peptides of th 
present invention (e.g., biotin-peptide conjugate). After 

10 isolation of the protein-peptide conjugate complex (e.g., on 
a column comprising streptavidin) , the activity of the enzyme 
can then be assayed by conventional methods to determine its 
level of protein kinase activity which can be taken as an 
indication of the presence of the deregulated or "activated" 

IS form of the enzyme . An assay for Src kinase has been 

described by Klinz and Maness, in Neuroprotocols (a companion 
to HfiUCflfiCiflnCfi) (1992) 1(3)1224-231. 

Moreover, the diagnostic agents of the invention can 
also serve as imaging agents of cells, tissues or organs, 

20 especially those that contain proteins with an SH3 domain. 
For example, neural cells (e.g., neurons, other areas of th 
brain), osteoclasts, osteoblasts, platelets, immune cells, 
and other dividing cells are known to express or contain 
proteins with SH3 domains. Thus, an image can be taken of 

25 portions of the body to serve as a baseline for subsequent 
images to detect physiologic or biochemical changes in the 
svibject's body. For instance, changes in the condition of 
cellular levels of Src or a transformation of the cellular 
Src to an "activated" form may be detected using the 

30 diagnostic or imaging agents of the present invention. 

Accordingly, it has been demonstrated that an SH3- 
binding peptide tagged with a fluorescence emitter can 
provide an image of the cytoskeleton. The images are 
presented in Figure 11. As can be seen from Figure 11, 

35 pan Is A, B, and C show th fluor sc nc image that is 
obtained on tr ating NIH 3T3 fibroblasts with SH3 d main- 
binding peptides modified t include a fluor scent tag. In 
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sharp contrast, panel D shows only a dark image that is 
produc d vh n th c lis ar tr ated with a prolin -rich 
s gment f vinculin as a c ntr 1. 

In another embodiment, an SH3 domain-binding peptide- 
S horseradish immunoper oxidase complex or related 

immiinohistochemical agent could be used to detect and 
quantitate specific receptor molecules in tissues, senun or 
body fluids. In particular, the present invention provides 
useful diagnostic reagents for use in immunoassays. Southern 

10 or Northern hybridization, and in situ assays. Accordingly, 
the diagnostic agents described herein may be suitable for 
use in vitro or in vivo. 

In addition, the diagnostic or imaging agent of the 
present invention is not limited by the nature of the 

15 detectable label. Hence, the diagnostic agent may contain 
one or more such labels including, but not limited to, 
radioisotope, fluorescent tags, paramagnetic substances, 
heavy metals, or other image-enhancing agents. Those of 
ordinary skill in the art would be familiar with the range f 

20 label and methods to incorporate or conjugate them into the 
SH3 domain-binding peptide to form diagnostic agents. 

In yet a fxirther embodiment, pharmaceutical compositions 
are provided comprising an SH3 domain-binding peptide and a 
pharmaceutical ly acceptable carrier. In a specific 

2S embodiment of the invention, the pharmaceutical composition 
is useful for the modulation of the activity of SH3 domain- 
containing proteins • By "modulation" is meant either 
inhibition or enhancement of the activity of the protein 
target. Accordingly, a pharmaceutical composition is 

30 disclosed comprising an SH3 domain-binding peptide and a 

pharmaceutically acceptable carrier, said peptide comprising: 
(i) a 9-mer secpience motif of the formula RXLP00PX^ (SEQ ID 
NO: 10), wherein X represents any amino acid except cysteine, 

0 represents a hydrophobic amino acid residue, and wherein ^ 
35 is a hydr philic amino acid r sidu xcept cysteine, each 

1 tter r pr s nting th standard on -letter designation for 
the corr spending amino acid residue; and, optionally, (ii) 
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additi nal amino acid residues flanking th 9*m r s quence at 
its C-t rainal end, N-terminal end or both, up to a total f 
45 amino acid residues, including said 9-m r s quenc . 
Preferably, the peptide comprises at least one, more 
5 preferably at least two, and most preferably at least thre 
additional amino acids flanking the 9-mer sequence. 

As stated above, the therapeutic or diagnostic agents f 
the invention may also contain appropriate pharmaceutically 
acceptable carriers, diluents and adjuvants. Such 

10 pharmaceutical carriers can be sterile liquids, such as water 
and oils including those of petroleum, animal, vegetable or 
synthetic origin, such as peanut oil, soybean oil, mineral 
oil, sesame oil and the like. Water is a preferred carrier 
when the pharmaceutical composition is administered 

IS intravenously. Saline solutions and aqueous dextrose and 
glycerol solutions can also be employed as liquid carriers, 
particularly for injectable solutions. Suitable 
pharmaceutical excipients include starch, glucose, lactose, 
sucrose, gelatin, malt, rice, tloxir, chalk, silica gel, 

20 magnesixim carbonate, magnesium stearate, sodium stearate; 
glycerol monostearate , talc, sodium chloride, dried skim 
milk, glycerol, propylene, glycol, water, ethanol and the 
like. These compositions can take the form of solutions, 
suspensions, tablets, pills, capsules, powders, sustained- 

25 release formulations and the like. Suitable pharmaceutical 
carriers are described in ^'Remington " s Pharmaceutical 
Sciences** by E.W. Martin. 

Such compositions will contain an effective therapeutic 
amount of the active compound together with a suitable amount 

30 of carrier so as to provide the form for proper 

administration to the subject. While intravenous injection 
is a very effective form of administration, other modes can 
be employed, including but not limited to intramuscular, 
intraperitoneal, and subcutaneous injection, and oral, nasal, 

3S enteral, and par nteral administration. 

The th rap utic agents and diagnostic agents of the 
instant inv ntion are us d f r th treatment and/or diagnosis 
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of animals, and mor preferably, manuaals including humans, as 
well as dogs, cats, horses, cows, pigs, guin a pigs, mic and 
rats. Accordingly, ther methods c ntemplated in th pres nt 
invention, include, but are not limited to, a method of 
S modulating, i.e., inhibiting or enhancing, bone resorption in 
a mammal (see, e.g.. Hall, T.J., in Biochem . Biophvs . Res . 
Commun. (1994) 199 (3) : 1237-44) , a method of disrupting 
protein tyrosine kinase-mediated signal transduction pathways 
or a method of regulating the processing, trafficking or 
10 translation of RNA in a cell by introducing or administering 
an effective amount of an SH3 domain-binding peptide of the 
present invention (see, e.g., Taylor, S.J. and Shalloway, D. , 
supra) • 

The diagnostic or therapeutic agents of the present 

15 invention can be modified by attachment to soluble 
macromolecules such as proteins, polysaccharides, or 
synthetic polymers. For example, the peptide could be 
coupled to styrene-maleic acid copolymers (see, e.g., 
Matsumura and Haeda, Cancer Res. (1986) 46:6387), 

20 methacrylamide copolymers (Kopececk and Duncan, J. Controlled 
Release (1987) 6:315), or polyethylene glycol (PEG) (e.g., 
Hershfield and Buckley, Enal. J, Med. (1987) 316:589; Ho 
et al., Prviq Mgtflfrf Pigpggf (1986) 14:349; Chua et al., AniLi. 
Intern. Med. (1988) 109:114). The agents, if desired, 

25 are further targeted by attachment to an antibody, especially 
a monoclonal antibody. Such antibodies include but are not 
limited to chimeric, single chain. Fab fragments, and Fab 
expression libraries. In one embodiment the agent is coupl d 
to the macromolecule via a degradable linkage so that it will 

30 be released in vivo in its active form. 

In another embodiment, the therapeutic or diagnostic 
agent may be delivered in a vesicle, in particular a 
liposome. See, Langer, Science (1990) 249:1527-1533; Treat 
et al., in Liposomes in the Theranv of Infectious Disease 

35 and Cancer , Lop z-Berestein and Fidler (eds.), Liss, New York 
(1989) pp. 353-365; Lopez-Berestein, ibid . . pp. 317-327. 
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In yet another embodiment, the therapeutic or in vivo 
diagnostic agent can be delivered in a controlled release 
system. In one embodiment, a pump may be used (se Langer, 
supra J Sefton, CRC Crit. Ref. Biomed. Eng. (1987) 14:201; 
S Buchwald et al. , Suraerv (1980) 88:507; Saudek et al., IL: 
Bnol. J. Med. (1989) 321:574). In another embodiment, 
polymeric materials may be used (see Medical Applications of 
Controlled Release, Langer and Wise (eds.), CRC Pres., Boca 
Raton, Florida, 1974; Controlled Drug Bioavailability, Drug 

10 Product Design and Performance, Smolen and Ball (eds.) Wiley, 
New York 1984; Raner and Peppas, J. Maeromol. Sci. Rev. 
Maeromol. Chem. (1983) 23:61; see, also, Levy et al., Scignce 
(1985) 228:190; During et al., Ann. Neurol. (1989) 25:351; 
Howard et al., J. weurosura. (1989) 71:105). In a preferred 

15 eaibodiment, a controlled release system may be placed next to 
the therapeutic target, thus requiring only a fraction of the 
systemic dose (see, e.g., Goodson, in MediCfll APPligfltiPng gf 
controlled Release, supra, (1984) 2:115-138). It will be 
recognized by one of ordinary skill in the art that a 

20 particular advantage of the invention is that a peptide will 
not be subject to the problems of denaturation and 
aggregation associated with proteins held in the warm, most 
environment of a body in a controlled release system. 

Other controlled release systems are discussed in the 

25 review by Langer, in Science (1990) 249:1527-1533. 



30 
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6. 

6.1. Fr^parati n t tha T8AR-9 Library 

6«l«l« Synthaaia and Aasod^ly f 
Oligonuolaotidas 

Figiure l shows the formula of the 
oligonucleotides and the assembly scheme used in construction 
of the TSAR-*9 library. The oligonucleotides were synthesized 
with an applied Biosystems 380a synthesizer (Foster City, 
CA) , and the full-*length oligonucleotides were purified by 

Five micrograms of each of the pair of oligonucleotides 
were mixed together in buffer (10 mM Tris-HCl, pH 8*3, 15 mM 
KCl, 0.001% gelatin, 1.5 mM magnesium chloride), with 0.1 % 
Triton X-100, 2 mM dNTP's, and 20 units of Tag DNA 
polymerase. The assembly reaction mixtures were incubated at 
72 for 30 seconds and then 30 ®C for 30 seconds; this 
cycle was repeated 60 times. It should be noted that the 
assembly reaction is not PCR, since a denaturation step was 
not used. Fill-in reactions were carried out in a thermal 
cycling, device (Ericomp, LaJolla, CA) with the following 
protocol: 30 seconds at 72 ^C, 30 seconds at 30 ^C, repeated 
for 60 cycles. The lower temperature allows for annealing of 
the six base complementary region between the two sets of the 
oligonucleotide pairs. The reaction products were 
2^ phenol /chloroform extracted and ethanol precipitated. 

Greater than 90% of the nucleotides were found to have been 
converted to dotible stranded synthetic oligonucleotides. 

After resuspension in 300 of buffer containing 10 mM 
Tris-HCI^ pH 7.5, 1 mM EDTA (TE buffer), the ends of the 
oligonucleotide fragments were cleaved with Xba I and Xho I 
(New England BioLabs, Beverly, MA) according to the 
supplier's recommendations. The fragments were pxirified by 
4% agarose gel electrophoresis. The band of correct size was 
removed and electroeluted, concentrated by ethanol 
precipitation and resuspended in 100 mL TE buffer. 
Approximately 5% of the assembled oligonucl otides can be 
expected to have int rnal Xho I or Xba I sites; how v r, only 
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th full-length m lecul s war us d in the ligation step of 
th asseatbly scheme. The cone ntration of the synth tic 

ligonucl tide fragm nts was estimated by comparing th 
intensity on an ethidium bromide stained gel run along with 
S appropriate quantitated markers. All DNA manipulations not 
described in detail were performed according to Maniatis, 
supra. 

To demonstrate that the assembled enzyme digested 
oligonucleotides could be ligated, the synthesized DMA 

10 fragments were examined for their ability to self-ligate. 
The digested fragments were incxibated overnight at 18 •€ in 
ligation buffer with T4 DMA ligase. When the ligation 
products were examined by agarose gel electrophoresis, a 
concatamer of bands was visible upon ethidium bromide 

15 staining. As many as five different unit length concatamer 
bands (i.e., dimer, trimer, tetramer, pentamer, hexamer) w re 
evident, suggesting that the synthesized DNA fragments wer 
efficient substrates for ligation. 

20 c.1.2. Coastruotion of Veotors 

The construction of the M13 derived phage 
vectors useful for expressing a TSAR library has been 
recently described (Fowlkes, D. et al. BioTeeh. (1992) 
13:422-427). To express the TSAR-9 library, an M13 derived 
25 vector, m663, was constructed as described in Fowlkes. Th 
m663 vector contains the pIII gene having a c-myc-epitope, 
i.e., as a stuff er fragment, introduced at the mature N- 
terminal end, flanked by Xho I and XJba I restriction sites 
(see also. Figure I of Fowlkes) . 

30 

c.l.s. Bxpressica of the T8AII-9 Library 

The synthesized oligonucleotides were then 
ligated to Jfho I and Xtoa I double-digested m663 RF DNA 
containing, the pIII gene (Fowlkes) by incubation with ligas 
35 V might at 12 Mor particularly, 50 ng of v ctor DNA 

and 5 ng of th digest d synth sized DNA and was mixed 
tog ther in 50 ligation buff r (50 mM Tris, pH 8.0, 10 mM 
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MgClj, 20 nM DTT, 0.1 bM ATP) with T4 DNA ligas . After 
ov might ligati n at 12*C, the DNA was concentrated by 
ethanol precipitation and washed with 70% ethanol. The 
ligated DNA was then introduced into E. colx (DHSoF*; GIBCO 
5 BRL, Gaithersburg, MD) by electroporation. 

A small aliquot of the electr operated cells was plated 
and the number of plaques counted to determine that 10* 
recombinants were generated. The library of B, coli cells 
containing recombinant vectors was plated at a high density 

10 (-400,000 per 150 mM petri plate) for a single amplification 
of the recombinant phage. After 8 hr, the recombinant 
bacteriophage were recovered by washing each plate for 18 hr 
with SMG buffer (100 mM NaCl, 10 mM Tris-HCl, pH 7.5, 10 mM 
MgCl,, 0.05% gelatin) and after the addition of glycerol to 

15 50% were frozen at -80 -C. The TSAR-9 library thus formed 
had a working titer of -2 x 10" pfu/ml. 

6.2. Preparation of T8AR-12 Library 

Figure 2 shows the formula for the synthetic 
20 oligonucleotides and the assembly scheme used in the 

construction of the TSAR-12 library. As shown in Figure 2, 
the TSAR-12 library was prepared substantially the same as 
the TSAR-9 library described in Section 6.1 above with the 
following exceptions: (l) each of the variant non-predicted 
is oligonucleotide sequences, i.e., NNB, was 30 nucleotides in 
length, rather than 54 nucleotides; (2) the restriction sites 
included at the 5' termini of the variant, non-predicted 
sequences were Sal I and Spe l, rather than Xho I and Xba I; 
and (3) the invariant sequence at the 3* termini to aid 
»0 annealing of the two strands was GC6GTG and CGCCAC rather 
than CCAGGT and G6TCCA (5* to 3*). 

After synthesis including numerous rounds of annealing 
and chain extension in the presence of dNTP's and Tag DNA 
polymerase, and purification as described above in Section 
6.1.1, the synth tic d uble stranded, oligonucleotide 
fragments were dig sted with Sal I and Spe I restriction 
enzymes and ligat d with T4 DNA ligas t the nucl tide 
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sequence encoding the M13 pIII gene contained in the in663 
vector to yield a library f TSAR- xpr ssion vect rs as 
described in S cti ns 6.1.2 and 6.1.3. Th ligat d DNA was 
then introduced into E. coli (DHSaF'; GIBCO BRL, 
5 Gaithersburg, MD by electroporation . The library of E. coli 
cells were plated at high density (-400,000 per 150 mn petri 
plate) for amplification of the recombinant phage. After 
about 8 hr, the recombinant bacteriophage were recovered by 
washing, for 18 hr with SMG buffer and after the addition of 
10 glycerol to 50% were frozen at -80 

The TSAR-12 library thus formed had a working titer of 
-2 X 10" pfu/BiL. 

6.3. Character isation of the TSAR-9 and -12 Libraries 

15 The inserted synthetic oligonucleotides for each of 

the TSAR libraries, described in Sections 6.1 and 6.2 above, 
had a potential coding complexity of 20^ (-10^) and 20", 
respectively, and since -10*^ molecules were used in each 
transformation experiment, each member of these TSAR 

20 libraries should be unique. After plate amplification the 
library solution or stock has 10^ copies of each member/ml*. 

It was observed that very few (<10%) of the inserted 
oligonucleotide sequences characterized so far in both of th 
libraries have exhibited deletions or insertions. This is 

25 likely a reflection of the accuracy assembling the 

oligonucleotides under the conditions used and the fact that 
certain types of mutations (i.e., frame-shifts) would not be 
tolerated as pIII an essential protein for phage propagation. 
In order to determine whether any coding bias existed in 

30 the variant non-predicted peptides expressed by these 
libraries, perhaps due to biases imposed in vitro during 
synthesis of the oligonucleotides or in vivo during 
expression by the reproducing phage, inserts were sequenced 
as set forth below. 

35 

6.3.1* Charaot risati n f T8AR-9 Library 
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Insert d synthetic ligonucleotid fragments 
of 23 rand mly chosen is lates wer examined f r m th TSAR-9 
library* Individual plaqu s w r us d to in culate I ml f 
2XYT broth containing B. coli (DHSaF*) cells and the cultxxres 
S were allowed to grow overnight at 37 •€ with aeration. DNA 
was isolated from the culture supernatants according to 
Haniatis, supra. Twenty*three individual isolates were 
sequenced according to the method of Sanger f Proc, Matl, 
Acad, sci. PSA (1979) 74:5463-5467) using as a primer the 

10 oligonucleotide 5 • -AGCGTAACGATCTCCCG (SEQ ID NO. 99), which 
is 89 nucleotides downstream of the pIII gene cloning site of 
the m663 vector used to express the TSARS. 

Nucleotide sequences and their encoded amino acid 
sequences were analyzed with the MacVector computer program 

15 (IBI, New Haven, CT) • The Microsoft EXCEL program was used 
to evaluate amino acid frequencies. Such analyses showed 
that the nucleotide codons coding for and hence most amino 
acids, occurred at the expected frecpiency in the TSAR-9 
library of expressed proteins. The notable exceptions were 

2 0 glutamine and tryptophan, which were over- and under- 
represented , respectively . 

It is of interest to note the paucity of TAG stop codons 
in the inserts, i.e., only 2 of -200 isolates characterized 
contained a TAG stop codon. About half [1- (47/48)'*) of the 

2 5 phage inserts were expected to have at least one TAG codon in 
view of the assei&bly scheme used. However, most of the TAG- 
bearing phage appear to have been lost from the library, even 
though the bacterial host was supE. This may be a 
consequence of suppression being less than 100% effective. 

30 The amino acids encoded by the inserted double stranded 

synthesized oligonucleotide sequences, excluding the fixed 
PG-encoding centers, were concatenated into a single sequence 
and the usage frequency determined for each amino acid using 
the Microsoft EXCEL program. These frec[uencies were compared 

35 to that expected from the ass mbly schem of the 

oligonucleotides, and th div rg nee fr m exp cted values 
represented by the size of the bars above and below th 
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bas line. Chi sguar analysis was used to det rmin the 
significance of the d viations. Th majority of amino acids 
w r f und to occur at th expect d fr quency, with th 
notable exceptions that glutamine and tryptophan were ^ 
5 somewhat over* and under ^represented , respectively. Thus, 
except for the invariant Pro-Gly, any position could have any 
amino acid; hence, the sequences are unpredicted or random. 

6.3«2. Character isatioB of T8AR-12 Library 

xo Approximately 10 randomly chosen inserted 

oligonucleotides from the TSAR-12 library were examined by 
DMA sequencing as described above in Section 6.3.1. The 
isolates were chosen at random from the TSAR-12 library and 
prepared for sequencing, as were the TSAR-9 isolates. 

15 Analysis showed that except for the invariant Gly any 

position could have any amino acid; hence, the sequences ar 
unpredicted or random. 

6.4. Preparation of RSC Library 

20 Referring now to Figure 3, two oligonucleotides 

were synthesized on an Applied Biosystems Model 380a machine 
with the sequence 5** 

TGACGTCTCGAGTTGTMNKMNIOINKNNKNNiaiNiaW^^ ' 
(SEQ ID MO: 31) and 5 • -GATCCTTCTAGATCC-3 • (SEQ ID NO: 32), 

25 where N is an equimolar ratio of deoxynucleo tides A, C, G, 

and T, and K is an equimolar ratio of G and T. Fifty pmol of 
each oligonucleotide was incubated at 42 •€ for 5 min, then 
37 •€ for 15 min, in 50 mL of Sequenase™ buffer (U.S. 
Biochemicals, Cleveland, OH) with O.l /ig/ML acetylated BSA, 

30 and 10 mM DTT. After annealing, 10 units of Sequenase™ (U.S. 
Biochemicals) and 0.2 mM of each dNTP were added and 
incubated at 37 for 15 min. The sample was then heated at 
65 for 2 hr, digested with 100 units of both Xho 1 and Xba 
I (New England BioLabs, Beverly, MA), phenol extracted, 

35 thanol pr cipitat d, and resolved on a 15% non-denaturing 
polyacrylamide g 1. Th ass mbl d, digested fragment was gel 
ptirified pri r to ligation. Th vector, m663 (Fowlkes, D. t 
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al* Bio-tech . (1992) 13:422-427), was pr pared by digestion 
with Xho I and Xba 1, calf alXalin ph sphatas (Boehringer 
Mannheim, Indianap lis, IN) tr atment, ph nol extracted, and 
purified by agarose gel electrophoresis. To ligate, 20 /ig 
5 vector was combined with 0.2 tig insert in 3 mL with T4 DNA 
ligase (Boehringer Mannheim) . according to the manufacturer. 
After removal of the protein and buffer by phenol extraction 
and ethanol precipitation, the ligated DNA was electroporat d 
into XLl-Blue coll (Stratagene, San Diego, CA) and plated 

10 for eight hours at 37 To recover the recombinant phage, 

the top agar was collected with a spatula, mixed with an 
equal volume of 100 mM NaCl, 10 mM MgClj, and 50 mM Tris-HCI 
(pH7.5), and disrupted by two passes through an 18*gauge 
syringe needle. The bacterial cells were removed by 

15 centrifugation, and phage particles were collected by 

polyethylene glycol precipitation and stored at -70 •€ in 25% 
glycerol. The library had 10* total recombinants and a 
working titer of 6 x lO^'pfu/mL. 

Members of the library were checked for inserts by the 

20 polymerase chain reaction (Saiki, et al. Science (1988) 
239:487-491). Individual plagues on a petri plate were 
touched with a sterile toothpick and the tip was stirred int 
2xYT with F*r. coli bacteria and incubated overnight at 37 
with aeration. Five microliters of the phage supernatant 

25 were then transferred to new tubes containing buffer (67 mM 
Tris-HCl, pH 8.8/10 mM fi^ mercaptoethanol/16. 6 mM ammonium 
sulfate/6.7 mM EDTA/50 /ig bovine serum albumin per mL) , 0.1 
mM deoxynucleotide triphosphates, and 1.25 units of Tag DNA 
polymerase (Boehringer Mannheim, Indianapolis, IN) with 100 

30 pmoles of oligonucleotide primers. The primers flanked the 
cloning site in gene III of m663 (5 • -TTCACCTCGAAAGCAAGCTG-S • 
(SEQ ID NO: 100) and 

5»-CCTCATAGTTAGCGTAACG-3' (SEQ ID NO: 101)). The assembly 
reactions were incubated at 94 for 1 min, 56 *C for 2 min, 
35 and 72 for 3 min; this cycl was r p at d 24 times. The 
reaction products w r th n resolved by electrophoresis on a 
NuSi ve 2.0% agaros g 1 (FMC, Rockland, ME). Gels rev aled 
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that for 20 plaqu s test d, all w re r combinant and had 
singl inserts of th xpected slz . 

Based on the sample size f the library. It was 
anticipated that 100% of the recombinants had single Inserts. 
5 However, all of the SH3 -binding phage Isolated from the R8C 
library had double-inserts, such phage are presumed rare 
(i.e., <5%) within the library, yet because the SH3 -binding 
peptide appears to need to be linear they were selected for 
by our screening methods. Most likely they were formed 
10 during the generation of the library; one scenario is that 
the inserts ligated together to form head-to-head dimers and 
that they were subsequently cloned into m663 DNA by ligation 
with the vector's Xho I sticky end and by illegitimate 
ligation with the vector's Xba I site (see. Figure 4). 

15 

C.5. Preparation Of Target-coated Microtiter Wells 

6.5.1. Preparation Of 08T-SB3 Pusion Proteins 

The preparation of Src-GST fusion protein 
was first described by Smith and Johnson, in SSDS. (1988) 

20 67:31, the disclosure of which is Incorporated by reference 
herein. Briefly, pGEX-derived (Phairmacla, Piscataway, NJ) 
constructs expressing GST fusion proteins containing the SH3 
domains of Src, 6rb2, Crk, Abl, or PLC7 were obtained from 
Dr. Channing Der (University of North Carolina at Chapel 

25 Hill) ; a construct expressing the SH3 domain of Yes was 

obtained from Dr. Narius Sudol (Rockefeller University) . The 
use of the pGEX bacterial expression vector for the 
production of GST-SH3 fusion proteins is well-known to thos 
in the art. see, e.g., cicchettl, P. et al., in Sgjgncg 

30 (1992) 257:803-806. Briefly, the coding region for a 

particular SH3 domain can be fused in-frame at the Bam HI 
site of pGEX-2T. Thus, fusion proteins were prepared as per 
the manufacturer's instructions, and quantified by Coomassle 
Blue staining of SDS-polyacrylamide gels. Microtiter wells 

35 wer coated with 5-20 ^tg GST-SH3 fusion protein in 100 mM 
NaHCO), pH 8.5, blocked with 100 mM NaHCO, (pH 8.5) 1% BSA, 
and washed. All washes consisted of five applications of 
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IXPBS, ©•!% Tween 20, 0-1% BSA (Buff r A). Where 
appropriat , th an unt of protein bound t: ach v 11 was 
quantlfi d with an anti-GST antibody-based ELISA (Pharmacia, 
.f Piscatavay, NJ) , and with a GST-binding phage, isolated 
S during the course of this work. 

6 .5. 2. Coating of Kierotlte? «ell9 

Bacterially expressed Src SH3 glutathione's- 
transferase (Src-GST) fusion protein was purified from 
bacterial lysates using glutathione agarose 4B (Pharmacia) , 
10 according to the manufacturer's instructions. Bound Src-GST 
fusion protein was eluted from the glutathione agarose with 
10 mM glutathione in PBS. Microtiter wells were then coated 
with Src-GST fusion protein (1-10 fig/wll, in 50 mH N2a{C03, pH 
8.5) overnight at 4 To block non-specific binding of 

15 phage, 100 1% BSA in 100 mM NaHCO,, pH 8.5, was added to 
each well and allowed to incubate at room temperature for 1 
hour. The wells were then washed five times with 200 PBS, 
0.1% Tween 20, 0.1% BSA (Buffer A). 

20 6.6. Biopanning And Subsequent Charaoterisation Of 

Pbage-Displayed Random Peptide Libraries 
With 8ro-08T Fusion Protein As Target Moleeule 

6.6.1. Isolation of Sro 8H3-Binding Phage 

Library screens were performed as 
previously described. Kay, B.K., et al., in Gene (1993) 
** 128:59-65. Briefly, 1 X 10" pfu TSAR 9, TSAR 12, or R8C 

phage in Buffer A were incubated in a Src SH3 -GST-coated well 
for 2 hours. The wells were washed, and bound phage were 
eluted with 100 fils 50 mM glycine -HCl (pH 2.2), transferred to 
a new well, and neutralized with 100 mL 200 mM NaHP04 (pH 

30 

7.0). Recovered phage were used to infect 1 x 10* DH5aF' E, 
coll cells in 20 nli 2xYT; the infected cells were grown 
overnight, resulting in a 1000- to 10,000-fold amplification 
of phage titer. Amplified phage were panned twice more, as 
above, excepting the aunplif ication step. Binding phage 
recover d after the third round of panning were plat d at a 
low density on a lawn f DHSaF* E. coll c lis to yield 
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Is lated plaqu s f r cl nal analysis. Isolated plaques were 
us d ^ produc small cultiires £r sk which phage stocks and 
DNA were r cover d for phag binding experiments and dide xy 
sequencing (Sanger, F*, et al./ in Proc, Natl. Acad. Sci. USA 
5 (1977) 74:5463-5467), respectively. Clones were confirmed as 
binding the SH3 domain by applying eqpial titers of phage to 
wells containing Src SH3-GST or GST alone, and titering the 
number of eluted particles from each well, or detecting bound 
phage with an anti-^phage antibody*based ELZSA (Pharmacia) • 

10 Indeed, the ability of isolated phage clones to bind to 

several SH3 domains derived from a variety of different 
proteins can be investigated by the manner described above. 
GST-SH3 fusion proteins containing SH3 domains from a variety 
of different proteins are bound to microliter wells. An 

IS aliquot of the aforementioned phage stocks (50 mL) is 

introduced into wells containing the different GST-SH3 fusion 
proteins. After room temperature incubation for 1-2 hotirs, 
the liquid contents of the microtiter plates are removed, and 
the wells are washed 5 times with 200 mI« Buffer A. Bound 

20 phage are eluted with 100 /iL 50 mM glycine (pH 2.2), 

transferred to a new well, and neutralized with 100 mI> 200 mM 
NaHP04 (pH 7.0). The phage are diluted 10*'- to 10^-fold, and 
aliquots are plated onto lawns of DH5aF' £. coll cells to 
establish the number of plaque forming tinits in the output 

29 sample. Prom these experiments, the relative specificity of 
different Src SH3 binding clones for SH3 domains derived from 
other proteins is determined. 

6*6«2« Phage BLX8A and Vuoleotide Sequencing 

30 To evaluate the binding of isolates to 
various targets proteins, enzyme* 1 inked- immuno-assays (ELISA) 
were also performed. Bacterial cultures were infected with 
phage isolates and cultured overnight in 2XYT at 37 *C. The 
cells were spun do%m and 25 mL of supernatant was added to 

35 microtiter plate w lis c ated with 50 mL of protein (1 mg/mL 
in 100 mM NeiHCO}, pH 8.4; overnight at 4 «C or f r a few hours 
at room temperature) and blocked (1 mg/mL BSA in 100 mN 
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NaHCO}, pH 8.4; f r about one hour). The phag are Incxxbat d 
in the w 11 with 25 mL of PBS-0*1% Tween 20 at RT for 2 hr. 
The wells are then washed multipl tines over 30 minutes. To 
each well is added 50 of polyclonal anti*phage antibody 
5 conjugated to horseradish peroxidase. The antibody is 
diluted l!30Q0 in BSA*PBS-Tween 20; it was obtained froa 
Pharmacia (Piscataway, NJ; catalog number 27-9402-01) « Aft r 
30 minutes, the wells are washed again with BSA-PBS-Tween 20 
for "20 minutes. Finally, 100 mL of ABTS reagent (Pharmacia, 

10 with H3O3) are added to each well for the development of 
color. Plates are read with a plate reader (Molecular 
Devices, Menlo Park/ OA) at 405 nm wavelength. 

The nucleotide secpience of the relevant segments of th 
Src SH3 binding clones (or phage clones that bind to SH3 

IS domains of other proteins) were sequenced using standard 
methods. Sanger, F. , et al., in Proc. Natl. Acad. Sci. USA 
(1977) 74:5463-5467. The oligo primer 5 • -AGCGTAACGATCTAAA-3 • 
(SEQ ZD NO: 102) was used, which is 89 nucleotides downstream 
of the gene III cloning site of H13 m666. The nucleotide 

20 sequences were analyzed with the MacVector computer program 
(IBI, New Haven, CT, USA). From this nucleotide secjuence 
information the primary sequence of each Src SH3 binding 
peptide was deduced. The corresponding synthetic peptides 
were then prepared by techniques well known in the art with 

25 or without flanking sequences. Indeed, these synthetic 

peptides have been shown to bind to SH3 domain targets, with 
those possessing the phage flanking amino acid residues 
exhibiting greater binding affinity. 

30 6.7 Jn Vitro Peptide Binding Assays 

Peptides were obtained from Research Genetics 
(Birmingham, AL) , Chiron Mimotopes (Victoria, Australia) , or 
synthesized by conventional techniques by Dr. J. Mark Carter 
of Cytogen Corporation (Princeton, NJ) . Peptide pxxrity was 
35 assess d by HPLC and/ r mass sp ctrom try. Biotinylat d 

peptid s were synthesized with either a KS6SG (SEQ ID NO: 103) 
or a GSGS (SEQ ID NO: 104) peptid link r (a spac r) betw en 
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the bi tin and th N-tenninus of the peptide. Binding 
experiments wer p rformed as ab v , excepting th use of 10 
IM peptide instead of phage. Bound bi tinylated peptid was 
detected with streptavidin conjugated to alkaline phosphatase 
S (Sigma Chemical Co., St. Louis, MO). After one hour 

incubation period at room temperature, the wells were washed, 
and a solution of 3 mN p-nitrophenyl-phosphate (US 
Biochemicals, Cleveland, OH) in 50 mM NaCO, (pH 9.8), and 50 
mM MgCl] was added and color allowed to develop. Signals wer 
10 read with an ELISA plate reader (Molecular Devices, Menlo 
Parle, CA) at 405 nm wavelength. Binding experiments were 
performed in triplicate. The results are presented in 
Figures 7 and 8. 

15 <.8. Peptide Competition of OST-8H3 Affinity 

Precipitations of cell Lysates 

Labeled proteins are prepared by incxibating a 

culture of HeLa cells overnight with ilOO MCi/mL "S- 

methionine. The cells are then washed and lysed with mild 

2Q detergent. This mixture of radioactive proteins is incubated 
with Src-GST fusion protein that has been immobilized on 
glutathione-linked Sepharose beads (Pharmacia, Piscataway, 
NJ) . After several hours of tumbling, the beads are pelleted 
gently by low-speed centrifugation, and the supernatant is 

25 discarded. The beads are then resuspended into a slurry in 
PBS-0.1% Tween 20, pelleted, and washed several additional 
times. Finally, a 2% SDS solution is added to the sample, 
which is then boiled at 100 "C for 3 minutes. Afterward, the 
sample is eentrifuged, and the supernatant loaded on a 10% 

30 polyacrylamide SDS gel for electrophoresis. After the 

proteins have been resolved, the gel is fixed, dried down, 
and exposed to X-ray film for autoradiography or phosphor 
plates for scanning by a Molecular Dynamics Phosphor Imager . 
The ability of Src SH3 to bind certain "s-labeled 

35 proteins is examin d f r comp tability with exogenous 
peptid 8. Synthetic peptides corresponding to phage- 
displayed inserts and motifs are add d at the time that the 
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lysat is incubat d with th Src-GST fusion protein 
imn bilized on glutathione-1 inked sepharos beads. The SH3 
binding p ptid s bl ck binding of all or some of the labeled 
proteins while negative control peptides (tinrelated peptide 
5 sequences) do not. The amount of competition is quantified 

peptides. 

Alternatively, NIH 3T3 cells were gro%m in Dulbecco"8 
Modified Eagle Medium (DM£) + 10% fetal calf serum (FCS) + 80 

10 MCi/mL Tran^Slabel (ICN) , washed with PBS, lysed in RIPA 
buffer, and pelleted. Supernatant from 1.5 x 10^ cells was 
precleared with 100 /xg glutathione-agarose-immobilized GST. 
The supernatant was then incubated with 10 /xg glutathione- 
agarose-immobilized GST-SH3 fusion protein with or without 

15 added test peptide in a final volume of 250 mL- Pelleted 
beads were washed with 1 mL each of RIPA, RIPA + 1% 
deoxycholate + 0.1% SDS, and PBS, resuspended in 50 
SDS-PAGE sample buffer, boiled, and subjected to SDS-PA6E 
(7.5%). labeled proteins were detected by phosphorimaging 

20 (Molecular E>ynamics} . The results are presented in Figxire 9. 

Peptide Competition of OST-6H3 Affinity Precipit- 
ations of PI-3* Kinase Prom Cell Lysates 

It is possible to follow the precipitation of PI-3 * 

22 Kinase by Src from cell lysates in the presence or absence of 

SH3-binding peptides. HeLa cells are lysed with detergent 

and the protein mixtures are incubated for several hours with 

the Src-GST fusion protein immobilized on glutathione- linked 

Sepharose beads. After several hours of tumbling, the beads 

are pelleted gently by low-speed centrifugation and the 

supernatant is discarded. The beads are then resuspended 

into a slurry in PBS-0.1% Tween 20, pelleted, and washed 

several additional times. Finally, an SDS solution is added 

to the sample, which is then boiled at 100 «c for 3 minutes. 

22 Subs quently, th sample is c ntrifug d, and th supernatant 

is loaded on a 10% polya cry 1 amide SDS gel for 

1 ctrophor sis. Aft r the prot ins have been resolved, the 

- 48 - 



BN8DOCID-<WO 0eO364aA1 ©> 



SUBSTITUTE SHEET (RULE 26) 



W 96/03649 PCT/US95/09382 



g 1 is blott d to nitr cellulose or nylon ( i . . , western 
blot) . Th filter is th n probed with a PI-3 • Kinase 
antibody (m noclonal and polycl nal antibodies are available 
from Upstate Biotechnology Incorporated, Lake Placid, MY), and 
S an enzyme- linked secondary antibody. The amount of PI-3' 
Kinase is then quantitated. 

The ability of Src SH3 to bind PI-3« Kinase is examin d 
for competability with exogenous peptides. Synthetic 
peptides corresponding to phage-displayed inserts and motifs 

10 are added at the time that the lysate is incubated with th 
Src-GST fusion protein that has been immobilized on 
glutathione- linked sepharose beads. Ten-fold and one 
hundred-fold molar excess of peptides are used relative to 
8H3 proteins. The SH3 binding peptides block binding of th 

15 PI-3' Kinase as detected on western blots while negative . 
control peptides (iinrelated peptide sequences) do not. The 
amount of competition is quantified and correlated with the 
amount of added SH3-domain binding peptides. 

20 C.IO. In Vijro Assooiatioa of 8B3-BindiBg Peptides 

With BBS-Domains Of Proteins 

To demonstrate association of the SH3 -binding 
peptides with SH3-domains of proteins inside cells, the SH3- 
binding peptides are tagged and localized in cells. For 
example, Bar-Sagi et al., in Ssll (1993) 74:83-91, have shown 
that SH3-binding proteins localize to the cytoskeleton when 
expressed in cells. Thus, the SH3 domain-binding peptides of 
the present invention can serve as cellular targetting 
signals (e.g., to the cytoskeleton). Accordingly, the 
30 peptides are tagged with biotin and, subsequently, injected 
into cells. Alternatively, one can transfect into cells a 
recombinant plasmid that expresses a fusion protein 
comprising of the SH3-binding peptide and the green 
fluorescent protein (GFP, Chalfie et al., in Sciengg (1994) 
263:802-805). The location f th biotinylated peptide or 
th GFP fusion protein is th n assay d with FITC-labeled 
streptavidin in paraf ormaldehyde-f ix d cells or by direct 
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fluor sc nee in living cells, respectively. Localization of 
the SH3*binding peptides to the cytoskeleton d monstrat s 
that the SH3*binding peptides can bind SH3 -domain proteins in 
vivo. In addition, focal adhesions, which are rich in Src, 
S are also sites of potential subcellular localization of SH3- 
binding peptides. 

Thus, NIH 3T3 fibroblasts were cultured in vitro on 
glass coverslips coated with fibronectin. After two days of 
growth at 37 the cells were fixed for one hour at room 

10 temperature in the presence of 2% paraformaldehyde (pH 7.5). 
The coverslips were washed with PBS-0.1% Tween 20 several 
times to remove the fixative. Next, the coverslips were 
dipped into acetone (chilled at -20 ^C) for approximately 20 
seconds and allowed to air-dry. The coverslips were washed 

15 again with PBS-0.1% Tween 20, containing BSA (1 mg/mL) , and 
incubated for 2 hoiirs at room temperature with different 
biotinylated peptides in PBS-0.1% Tween 20. The coverslips 
were washed and then incubated with 1 mg/mL streptavidin-Cy3 
(Jackson Immunoresearch Co., West Grove, PA) for 1 hour at 

20 room temperature. Finally, the coverslips were washed in 
PBS-0.1% Tween 20, mounted in a glycerol solution on a glass 
slide, and viewed with a Nikon Optiphot epif luorescence 
microscope and a 60x oil immersion lens. 

The results are presented in Figure ll, in which panel A 

25 displays cells stained with the conjugate biotin-spacer- 

VIJCRPLPIPPVTR (SEQ ID NO: 64); panel B exhibits cells stained 
with the conjugate, biotin-spacer-GILAPPVPPRNTR (SEQ ID 
NO: 63); panel C shows cells stained with the long consensus 
peptide, biotin-spacer-RSTPRPLPPLPTTR (SEQ ID NO: 67); and 

30 panel D shows cells stained with the proline-rich vinculin 
peptide conjugate, biotin-spacer-LAPPKPPLPEGEV (SEQ ID 
NO:70). The "spacer" sequence is KSGSG (SEQ ID NO:103) . As 
shown in Figure 11, the panels in which SH3 domain-binding 
peptides were used present a bright display of fluorescence 

35 activity that is in sharp contrast to th relatively "dark" 
f atures of panel D (non-SH3 domain binding vinculin 
s gment) . Th s results demonstrate further the ability of 
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10 



15 



the SH3 d main-binding peptides f the pres nt inv ntion to 
localiz to pr tein targets ( ,g., Src and Src-relat d 
proteins) within cells and provid an imag th reof . 

C.ll. In Vijro Modulation Of Src in oocytes with 
8H3 -Binding Peptides 

When XBiiopus laevis oocytes are injected with 
mRHA encoding deregulated Src, there are dramatic cytological 
and biochemical changes in the oocyte (Unger, T.F. and 
Steele, R.E., Mgl g^^^- (^992) 12:5485-5498). The 

applicants have obtained the plasmid for generating Src mRNA, 
which is available from Dr. Robert Steele (University of 
California at Irvine) . Synthetic SH3 -binding peptides are 
injected into oocytes that have been previously injected with 
Src mRNA. The state of the cytoslceleton is inspected 
visually by observing the arrangement of cortical pigment: 
granules under a dissecting microscope. The state of 
phosphorylation of several proteins is examined by western 
blotting with an anti-phosphotryosine monoclonal antibody 
(4G10; Upstate Biotechnology Incorporated), as described^,. in 
Unger and Steele, above. 

6.12. Progesterone-induced X. laevis oocyte 
Maturation 

Segments of adult ovary were removed 
surgically and incubated in 0.1% collagenase type D 
(Boehringer Mannheim, Indianapolis, IN) in Ca**-free OR2 (82.5 
mM NaCl, 2.5 mM KCl, 1.0 mM MgCl], 1.0 mM NajHPO*, 5.0 mM 
HEPES, and 3.8 mM NaOH, pH 7.6). oocytes were then washed 3- 
5 times with OR2 containing 1.0 mM CaCl} and allowed to 
recover in 0R2 overnight at 18 •C. Stage VI oocytes were 
injected with 40 nL of 100 mM peptide or water. After 
injection, the oocytes were placed in OR2 with 2 mg/mL 
progesterone (Sigma, St Louis, MO) and incubated at 20 'C. 
Oocytes were scored at hourly time points for germinal 
vesicle br alcdown (6VBD) . 
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Figxire 10 pr sents th results of this experiment. As 
sh vn by th graph, oocyt s injected with th SH3 domain- 
binding peptid VLKRPLPIPPVTR (SEQ ID NO: 64) exhibit a faster 
rate of progesterone* induced germinal vesicle breakdown 
5 relative to oocytes that had been injected with water or with 
the proline-rich vinculin peptide « lAPPKPPLPEGEV (SEO id 
NO: 70). These results parallel those obtained by Unger and 
Steele, suprA, who observed a gross alteration in the cortex 
of oocytes that had been injected with RNA encoding a 

10 deregulated Src variant (••active'' Src) versus those injected 
with RNA encoding the wild-type Src (••cellular** Src). Also, 
as shown in Figure 3B of the Unger and Steele article, 
oocytes injected with deregulated or active Src RNA matured 
at a faster rate than oocytes injected with water or wild- 

15 type Src RNA. 

The present results obtained with Src SH3 domain-binding 
peptides suggest that these peptides modulate the biochemical 
activity of ••cellular •• Src; in particular, it is proposed 
that at least some of the Src SH3 domain-binding peptides of 

20 the present invention upregulate the biochemical activity of 
••cellular^^ Src, which may be downregulated or inhibited in 
its normal state. Hence, the administration of the SH3 
domain-binding peptides of the present invention can 
constitute a novel method of modulating the activity of Src 

25 or Src-related proteins. Specifically, certain of these 
peptides are able to activate Src-family proteins. 

6.13. xn vivo Antagonism Of Src In Sro Transformed 

Cells With SH3 -Binding Peptides 

30 The coding regions for SH3-binding peptides 

are cloned into vectors that direct their expression in 

animal cells. A bipartite gene is constructed, encoding a 

protein with c-myc epitope and SH3 -binding peptide, which is 

transcribed from a strong constitutive promoter (e.g., SV40, 

CMV, HSV TK, calmodulin) . The vector is introduced into 

either normal or Src-transf ormed c lis via transf ction 

(e.g., electroporation, calcixim phosphate, liposomes, OEAE 
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d xtran) . Transf cted c lis expr ss th bipartite gene 
transl ntly in cultur . To cr ate stabl transform d cell 
lines, the vector carries a sel ctable marker ( .g. , n omycin 
resistance) or transfection is performed in the presence of 
5 excess plasmid carrying a selectable marker (e.g., neomycin 
resistance) and cells selected for the marker. Transf acted 
cells are stained by immunofluorescence to detect express! n 
of the bipartite protein. The hybridoma 9E10 secretes a 
monoclonal antibody that is highly specific for the c-myc 

10 epitope (EQKLISEEOLN (SEQ ID NO: 105]; see, Evan, G.A. et al., 
4» MoT . gftii. Biol . (1985) 5:3610-3616). This antibody is 
used in immunofluorescence experiments to demonstrate that 
the bipartite protein is expressed inside the cells, and in 
some cases, localized to subcellular structures enriched in 

15 SH3 domain bearing proteins. 

There are several controls used in these experiments. 
First, cells are transf acted with vectors that do not have 
the SH3-binding peptide coding region. Second, normal (non- 
transformed) cells are transfected. Third, cells transformed 

20 by oncogenes other than Src are used in the transfection: 

experiments. Fourth, cells are stained with other monoclonal 
antibodies that do not recognize the c-mye epitope. 

Transfected cells are exeunined for any changes in cell 
shape, behavior, and metabolism as a consequence of 

25 expressing the SH3 binding peptides. Cell shape is examined 
by phase contrast microscope at several times after 
transfection; in particular, the flatness of the cells, their 
adhesion to the substrate, and the degree of cell ruffling 
are monitored. Cell division rates, cell migration, and 

30 contact inhibition are also observed over time. Finally, the 
amount of phosphorylated tyrosine in transfected cells is 
quantitated by phosphoaminoacid analysis and with an anti- 
phosphotryosine monoclonal antibody (4G10; Upstate 
Biotechnology Incorporated) in western blotting experiments. 
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€•14 Pr^parati n f PXXP Biasad Pap t Ida Librari 0 

The preparation and characterization of preferr d phag - 
displayed rand m peptide libraries have be n described above 
in Sections 6*1 - 6.4. 
S Using procedures similar to those described in these 

sections, oligonucleotide inserts were contructed according 
to the schematic provided in FIG. 12. The inserts were then 
cloned into the mBAX vector, and the biased peptide libraries 
were expressed as described previously. 

10 The mBAX vector was created in the Kay Laboratory by 

generating cloning sites in gene III of the Ml3mpl8 vector 
(Messing, J. (1991). Cloning in M13 phage or how to use 
biology at its best. Gene 100, 3-12) in the manner of 
Fovlkes et al. (1992) (Biotechnigues 13, 422-427). The mBAX 

15 vector displays a peptide sequence at the N-texnninus of the 
mature pIII protein that encodes the epitope for the mouse 
monoclonal antibody 7E11 (see, FIG. 13); it includes the stop 
codon TAG in the coding region, which is suppressed in 
coli carrying suppressor tRNA gene mutations known as supE or 

20 supF. There are no other stop codons in the mBAX genome. 
The mBAX vector also carries a segment of the alpha fragment 
of B-galactosidase; bacterial cells expressing the omega 
fragment of fi-galactosidase convert the clear XGal substrate 
into an insoluble blue precipitate. The plaques appear blu . 

29 Recombinant mBAX molecules can be distinguished easily 
from non-recombinant molecules due to the TAG codon in the 
Xhol - Xbal segment in gene III of mBAX. When recombinants 
are generated by replacing the Xho I - Xba I fragment with 
oligonucleotides encoding random peptides, the recombinants 

30 can be gro%m in bacteria with (i.e., DHSaF*) or without 
(i.e., JS5) suppressor tRNA mutant genes. On the other hand, 
the non-recombinant xoBAX molecules fail to produce plaques on 
bacterial lawns where the bacteria (i.e., JS5) lack such 
suppressor genes. This is because in JS5, the TAG codon 

as serv s as a stop codon to yi Id a truncat d pIII molecule 
during translation; since pIII is an essential prot in 
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c mponent of viabl Ml 3 viral particles, no plaqu s will 
f m. 

C.14.1 Disoussi a 

The use of second generation or biased peptide 
5 libraries, which fix all or part of the RPLPPLP consensus^ 
motif f or SH3 binding peptides and randomize flanking 
residues, has defined additional sequence residues exhibiting 
selective SH3 binding. 

Tables 1-5, below, list the relevant amino acid 
10 sequences obtained from the biased peptide library for each 
set of SH3 domain binding peptides. The underscored amino 
acid residues indicate the fixed positions. Also, indicated 
for each set of new binders is a "consensus" sequence, which 
seelcs to include the additional features gleaned from the new 
15 binding peptides. The symbol "0" represents a hydrophobic 
residue. 



TABLE 1 CORTACTIN SH3 -BINDING PEPTIDES 



20 



29 







SEQ. ID NO. 


PXXP . CORT . Ml/ 2 / 3 . PP 


SSI<L6PPVEPI^TLFSFSR 


107 


PXXP.CORT.M4.PP 


SRLGEFSK£PI£QKPTWMSR 


108 


PXXP. CORT. N2.PP 


SRTERPPLEQR£DWLSYSSR 


109 


PXXP . CORT . N3 . PP . INC 


SREPDWLC£NC£LLIiRSDSR 


110 


PXXP . CORT . 0 1 / 2 / 3 . PP 


SSSSHNSR£PL£EKPSWLSR 


111 


PXXP. CORT. 04. PP 


SRLTPQSI^PLEPKPSAVSR 


112 


CONSENSUS 


KPP^PxKPxW 
R 


113 
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TABLE 2 NCX SH3 -BINDING PEPTIDES 



10 



15 







SEQ. ID NO. 


PXXP.NCK.Q1/4.PP 


£SL6VGWK£LP£MRTASI^ 


114 


PXXP . NCK . Q2 /3 . PP . INC 


£SV6FADR£RP£LRVESLSR 


115 


PXXP . HCK . Rl . PP . INC 


SSASILRPPSKPXRSrSLSR 


« « ^ 


PXXP. NCK. R2.PP 


£SPYTGDVPIPPLRGASLSB 


117 


PXXP. NCK. R3.PP 


gglUGSWPPVPPLRSDSLSR 


118 


PXXP. NCK. R4.PP 


SSIGEDTPPSPPTRRASLSR 


119 


PXXP . NCK . SI/ 4 . PP 


£BSLSEVS£KP£IRSVSLSR 


120 


PXXP . NCK . S2 . PP . INC 


SSVSEGYSPPLPPR5TSLSB 


121 


PXXP. NCK. S3. PP 


£SSFTLAAPTPPTRSLSL^ 


122 


PXXP. NCK. Tl.PP 


S5PPYELP£RP£NRTVSL£B 


123 


PXXP. NCK. T2.PP 


£BWDGLA£PP£VRLSSL>SE 


124 


PXXP . NCK . T3 . PP . INC 


£SI'GYSGA£VP£HRXSSL£B 


125 


PXXP. NCK. T4 .PP 


SSI SDYSRPPPPVRTIiSLSR 


126 


CONSENSUS 


^XXXX3{£XP£^RSXSL 
T 


127 
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TABLE 3 ABL SH3 BINDING PEPTIDES 



10 







SEQ. ID NO. 


PXXP.ABL.G1/2.PP 


SB6PRWSP£PV£LPTSLDSB 


128 . 


PXXP.ABL.G3/4.PP 


SSPPDYAA£AI£SSLWVD£B 


129 ' . 


PXXP.ABL.H1/3/4.PP 


SSPPHWAP£AP£AMSPPI£B 


130 


PXXP . ABL . H2 . PP . INC 


SSDRCWEC£PWEAGGQRGSR 


131 


PXXP. ABL. 11/2/3. PP 


SSPPKFSP£PP£YWQLHA&R 


132 


PXXP. ABL. 14. PP 


£SPPSFAP£AA£PRHSF6fiB 


133 


PXXP. ABL. Jl.PP 


fiSAPKKPA£PV£MMAHVHSB 


134 


PXXP . ABL . J2 . PP . INC 


SSPTYPPPEPPPTAKGASB 


135 


PXXP . ABL . J3 . PP . INC 


££PPXXXP£PI£NSPQVLS£ 


136 


PXXP. ABL. J4.PP 


SfiPPTWTP£KP£GWGVVFfiB 


137 


PXXP. ABL. LI. PP 


8SAPTHSP£AL£NVAKYKSS 


138 


PXXP . ABL . L2 / 3 . PP 


SSIKGPRFEVPEVPLNGVSB 


139 


PXXP. ABL. L4.PP 


SSPPAWSP£HB£VAFGSTSB 


140 


CONSENSUS 


PPxWxPPP^P 


141 


TABLE 4 PLC 


:G SH3 -BINDING PEPTIDES 








SEQ. ID NO. 


PXXP.PLCG.Pl.PP 


SSMKVHNF£LP£LPSYET£B 


142 


PXXP.PLCG.P2.PP 


SRVPPLVA£RP£STLNSLSB 


143 


PXXP . PLC6 . PE . PP . INC 


2SLYWQH6£DP£VGAPQL5£ 


144 


PXXP.PLCG.P4.PP 


SSHPLNSH£GG£FRHNLSSB 


145 



30 



35 



- 57 - 

SUBSTITUTE SHEET (RULE 26) 

BNSOOCID: .^WO_0603e4aA1JB> 



W 96103649 



PCT/DS9SA9382 



TABLE 5 SRC SH3 -BINDING PEPTIDES 









SEQ. ID NO. 


s 


PXXP.SRC.Al.PP 


fiSRALRVR£LP£VP6TSL^ 


146 1 




PXXP.SRC.A2.PP 


aSPRAI«PL£PT£DNPFAGSB 


147 




tr AJUr • £»Kw • • 


SRPAr^Sijgr^Kf LtPPVATI SR 


148 




FXXF • SRC • A4 • PP 


figX SQRALPPIiPIiMSDPASR 


149 


10 


PaaF • SRC • oX • PP 


fiSPA xRPjjERLfDIiS V X YSR 


ISO 




PiUkP « SKC • / O / PP 


fiSr XNRRLtPAXiPPPNSXiLSR 


151 




PXXP • SRC • o4 • PP 


SRIiTGRPLPALPPPFSDFSR 


152 




PXXP • SRC • CI • PP 


SHMKPRVIiPPIPTvESAVSR 


153 




PXXP . SRC . C2 • PP • INC 


SSLYSAIAPDPPPRNSSSSR 




15 


PXXP. SRC. C3.PP 


SSLASRPLPLLPNSAPGOSR 


155 




PXXP. SRC. Dl.PP 


SSLTSRPLeDI£VRPSKS£B 


156 




PXXP . SRC. D2 . PP . INC 


SSLKHRAIfPIfETDTPYSB 


157 




PXXP. SRC. D3.PP 


SSNTNRLP£PT£D6LDVR^ 


158 


20 


PXXP. SRC. D4.PP 


SSLQSRPLPLPPQSS YPI SR 


159 




CONSENSUS 


RPLPPLP 


160 



It should be apparent to one of ordinary skill that many 
other embodiments of the present invention can be 
contemplated beyond the preferred embodiments described above 
but which other embodiments nevertheless fall within the 
scope and spirit of the present invention. Hence, the 
present invention should not be construed to be limited to 
the preferred embodiments described herein, which serve only 
to illustrate the present invention, but only by the claims 
that follow. 

Also, numerous references are cited throughout the 
specification. The complete disclosures of these references 
are incorporated by reference herein. 



25 



30 



- 58 

SUBSTITUTE SHEET (RULE 26) 

BN8TOCCD: <WO_9e0384eA1JB> 



W 96/09649 



PCT/US95A)93K 



SBQUBNCB LISTING 



(1) SNSRAL INFORMATION t 

<i) APPLICANTS SPARKS, ANDRBM B. 

KAY, BRIAN K. 
THORN, JUDITH M* 
gUZLLIAM, LAWRXNCB A* 
DXR, CHANNING J. 
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(111) NUKBSR OP SBOCTNCBSt 106 
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30 



35 



(2) INFORMATION FOR SBQ ID NOsls 

(1) SBQUBNCB CHARACTBRISTICSs 

(A) LBNGTHs 10 aiDlno acids 

(B) TYPBs amino acid 
(D) TOPOLOGY s un)cnown 

(11) MOLBCULB TYPBs poptldo 

(Ix) FBATURBs 
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(D) OTHBR INFORMATIONS /noto- "hydrophobic roalduo (auch 
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(xl) SBQUBNCB DBSCRIPTI Ns SBQ ID N sis 

Xaa Pr Xaa Xaa Pro Pro Pr Xaa Xaa Pro 

X 5 10 

- 59 - 



SUBSTITUTE SHEET (RULE 26) 



BNSOOCID: <WO_0eO3d4SA1JB> 



W 96^3649 



PCr/US9SA)9382 



(2) INFORMATION FOR SSQ ID HOs2t 

(i) SSQOSNCS CHARACTSRISTICSs 

(A) LSNOTBi 9 amino acids 

(B) TYPKi amino acid 
(D) TOPOLOOYs unknown 

5 (ii) MOUCUU TYPBt paptida 

lix) FSATURBs 

(A) MAMB/KSYi Modif iad-aita 

(B) LOGATIOMi 

(D) OTHBR INFORMATIcmx /nota» -Xaa - any rasidua othar 
than Cya* 

(ix) FEATURBi 
XO (A) NAMS/KBYt Kodif iad-sita 

(B) LOCATKWt 6 

(D) OTHBR INFORMATIONS /nota- "X - any raaidua othar 
than Cya" 

(ix) FBATURBl 

(A) NAME/KBYi Modif iad-aita 

(B) LOCATIONS 8*. 9 

(D) OTHBR INFORMATION: /nota« *X • any raaidue othar 
15 than Cya" 

(xi> SBQOBNCB DBSCRIPTICHIs SBQ ID NOi2i 

Xaa Xaa Xaa Pro Pro Xaa Pro Xaa Xaa 

I 5 

(2) INFORMATKHi FOR SBQ ID NOs3: 

20 (^) SBQUBNCB CHARACTBRISTICS: 

(A) LBNGTHs 9 amino acida 

(B) TYPBt amino acid 
(D) TOPOLOOYt unknown 

(ii) MOLBCULB TYPB: paptida 

(ix) FBATURBs 

(A) NAMB/KEYs Modif iad-aita 
as (B) LOCATIONS 2 

(D) OTHBR INFORMATIONS /nota» "X tanda to be hydrophobic' 

(ix) FBATURBs 

(A) NAMB/KBYs Modif iad-aita 

(B) LOCATIONS 8.. 9 

(D) OTHBR INFORMATIONS /nota- "^X tanda to ba hydrophobic" 



30 



(xi) SBQUBNCB DBSCRIPTIONs SBQ ID NOs3s 

Arg Xaa Lau Pro Pro Arg Pro Xaa Xaa 

1 5 



(2) INFORMATION FOR SBQ ID NOs4s 

(i) SBQOBNCB CHARACTERISTICS s 
(A) LBNOTHs 9 amino acida 
35 (B) TYPBs amin acid 

(D) TOPOLOGY s unknown 

(ii) MOLBCOLB TYPBs paptida 
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(xi) SBQUBNCB DBSCRZPTIONt 8BQ ZD NOi4t 

Arg Lym Imu Pro Pr Arg Pro Arg Arg 

1 5 

(2) ZNPORMATZON FOR SBQ ZD NOtSs 

' (1) SBQOBNCB CSiARACTBRISTICSi 

<A) LBNOTHi 10 amino acids 
(B) nrPBt amino acid 
(D) TOPOLOQYt unknom 

(ii) KOLBCOLB TYPBs paptida 

(xi) SBQUBNCB DBSCRZFTZONs SBQ ZD NOiSi 

10 

VaX Pro Pro Pro Val Pro Pro Arg Arg Arg 
15 10 

(2) ZNFORKATZCm FOR SBQ ZD NOs6i 

(i) SBQUBNCB CBARACTBRZSTZCSt 
(A) LBNOTHi 7 amino acids 
XS (B) TYPBs amino acid 

(D) TOPOLOOYt unknom 

(ii) MOLBCULB TYPBs paptida 

(ix) FBATURBs 

(A) NAMB/KBYt Modif iad-aita 

(B) XiOCATZONs 2 

(D) OTHBR ZNPORMATZON t /nota- "tands to ba Pro*" 

ao 

(ix) FBATURBs 

(A) NAMB/XBYs Modif iad-aita 

(B) LOCATIONS 3 

(D) OTHBR ZNFORMATZONs /nota- "hydrophobic raaidua" 

(ix) FBATURBs 

(A) NAMB/XBYs Modifiad-^sita 

(B) LOCATIONS 5 

25 (D) OTHBR ZNFORMATZONs /nota« "tanda to ba Pro" 

(xi) SBQUBNCB DBSCRZPTZONt SBQ ZD NOs6s 

Baa Baa Baa Pro Baa Baa Pro 

1 5 



30 (2) ZNPORMATZON FOR SBQ ZD NOs7s 

(i) SBQUBNCB CHARACTBRZSTZCSi 

(A) LBNCTHs 9 amino acids 

(B) TYPBs amino acid 
(D) TOPOLOGY I unkno%m 

(ii) MOLBCULB TYPBs paptida 

35 (ix) FBATURBs 

(A) NAMB/KBYs Modifiad-ait 

(B) LOCATZONs 2 

(O) OTHBR ZNFORMATZONt /nota- « any raaidua othar 
than Cys" 
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(Ix) PBATURBi 

(A) MAMB/KSyt Nodifi«d-«it« 

(B) LOCATXONi 8. .9 

(D) OTHBR INFORMATIONS /n t « "X - any r sidua othar 
than Cya* 

(Xi) SBQOBNCB DESCRIPTION t 8EQ ID NOs7i 

* Arg Xaa Lau Pro Pro Arg Pro Xaa Xaa 

1 5 

(2) INFORHATimf FOR 8BQ ID NOi8s 

(1) SBQUBNCE CHARACTBRISTICS: 

(A) LBNGTHt 9 amino aeida 
10 (8) TYPES amino acid 

(D) TOPOLOGY t unknom 

(ii) MOLECULE TYPES paptida 

(ix) FEATURES 

<A) NAME/KEYS Modif iad-aita 

(B) LOCATIONS 9 

(D) OTHER INFORMATIONS /nota- "hydrophobic raaidua" 

15 

<xi) SEQUENCE DESCRIPTIONS SEQ ID NOs8s 

Arg Xaa Lau Pro Pro Lau Pro Arg Pha 

1 5 

(2) INFORMATION FOR SEQ ID NOs9s 

20 SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 7 amino acida 

(B) TYPES amino acid 
(D) TOPOLOGY s unknom 

(ii) MOLECULE TYPES paptida 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NOs9s 

25 ^9 Pro X^u Pro Pro Lau Pro 

1 5 

(2) INFORMATION FOR SEQ ID NOslOs 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 9 amino acida 

(B) TYPES amino acid 
30 (D) TOPOLOGY s unltnown 

(ii) MOLECULE TYPES paptida 

(ix) FEATURES 

(A) NAME/KEY s Modif iad-aita 

(B) LOCATIONS 2 

(D) OTHER INFORMATIONS /nota« "any raaidua othar than Cyi 

35 (ix) FEATURES 

(A) NAME/KEY s Modif iad-aita 

(B) LOCATIONS 5.. 6 

(D) OTHER INFORMATION: /n t « "any hydr phobic raaidua" 
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(Ix) FKATURXt 

(A) HAia/iaBYt Nodlfi«d-ait« 

(B) LOCATZ<»ls 9 

(D) OTHER ZKPORMATIONt /not « ••ny hydrophilic r«sidu 
•xcspt Cys* 

(xi) SBQOSNCB DKSCRIPTXONt SBQ ID MOslOs 

' hrg Xaa Lp«u Pro Xaa Xaa Pro Xaa Xaa 

1 5 

(2) INFORHATimi FOR SBQ ZD KOtllt 

(i) 8XQ0BNCB CHARACTBRZSTZCSt 
(A) LXHOTHt 9 amino acids 
««l (B) TYPBt amino acid 

(0) TOPOLOGY s unknoim 

(ii) HOLBCULB TYPBt paptida 

(xi) SBQOBNCB DBSCRZPTIONs SBQ ZD NOslls 



Arg Pro I^au Pro Pro tmu Pro Thr Sar 

1 5 



15 



(2) ZNFORMATZON FOR SBQ ZD NOil2t 

(i) SBQOBNCB CHARACTBRZSTZCS: 
(A) LBNGTHt 4 amino acids 
<B) TYPBt amino acid 
(D) TOPOLOGY: linaar 

20 (ii) HOLBCULB TYPBt paptida 

(V) FRAGMBNT TYPBs 

(xi) SBQUBMCB DBSCRZPTZONt SBQ ZD NOt 12 t 



25 



Xaa Xaa Xaa Xaa 

1 



(2) ZNFORMATZCm FOR SBQ ZD NOsl3t 

(i) SBQOBNCB CHARACTBRZSTZCS s 

(A) LBNGTHt S amino acids 

(B) TYPBt amino acid 
(D) TOPOLOOYi linaar 

(ii) MOLBCOLB TYPBt paptida 

(V) FRAGMENT TYPBt 

(xi) SBQOBNCB DBSCRZPTZONt SBQ ZD NOt 13 1 

Xaa Xaa Xaa Xaa Xaa 

1 5 



35 (2) ZNFORMATZON FOR SBQ ZD NOt 14 1 

(i) SBQOBNCB CHARACTBRZSTZCS t 

(A) LBNGTHt 4 amin acids 

(B) TYPBt amino acid 



30 
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(D) TOPOLOGY t linear 
(ii) KOLBCULE TYPSs pttptid« 

(V) FRAGKBNT TYPBt 
(xi) 8SQUKNCB OKSCRIPTXONt 8BQ 10 NOtl4t 

5 

Xaa Zaa Xaa X«« 

1 

(2) INFORMATION FOR 8BQ ID lK>sl5t 

(1) SBQOBNCB CHARACTBRISTIC8I 

(A) LBNGTHt 13 Amino acids 
10 (B) TYPBt Amino acid 

(D) TOPOLOGY s unknoim 

(ii) MOLBCULE TYPB: p«ptid« 

(ix) FBATORBt 

(A) NAKB/KBYt Modif iAd-flitA 

(B) LOCATION: 1..3 

(D) OTHER INFORMATION: /note- "any rasidue other than Cys- 

IS 

(ix) FBATUREi 

(A) NAME/KEY t Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note- "any residue other than Cys" 

(ix) FBATURB: 

(A) NAMB/XBY: Modif ied-site 

(B) LOCATICm: 

20 <D) OTHER INFORMATION: /note- "any hydrophobic residue" 

(ix) FBATURB: 

(A) NAMB/XBY: Modif ied-site 

(B) LOCATION: 11. .13 

(D) OTHER INFORMATION: /note- "any residue other than Cys" 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Xaa Xaa Xaa Arg Xaa Leu Pro Xaa Xaa Pro Xaa Xaa Xaa 

15 10 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) 8BQUBNCB CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



35 



Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xai 

15 10 15 



2Aa Xaa Xaa Xaa Pr Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xai 

20 25 30 



xaa Ser Arg 
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35 40 

(2) ZNFORMATI N FOR 8BQ ID llO»17s 

(i) SBQUXNCB CHARACTBRZ8TIC8S 

(A) LBMOTHs 69 ba0« pairs 
m (B) TYPBs nuclttic acid 

^ (C) STMUfDBDNSSSt both 

(O) TOPCLOOYi linsar 

(11) HOLBCDLB TYPSs DMA {ganomic) 
(Xl) 8SQUENCB DS8CRZPTIONt 8BQ ID HOsl7« 
GOCTOGAONN NBNNBimBltN BNNBNNBNNB NNBNNBNNBN HBNNBNNBNN BNNBNNBNNB 60 
" HHBCCAOOT 

(2) XKPORMATION FOR 8EQ ID NOsl8s 

(1) 8BQ0SNCE CHARACTBRISTIC8t 

(A) LXNOTHs 68 baaa pairs 

(B) TYPE I nuclaic acid 
XS (C) STRANDSDNBSSt both 

(D) TOPOLOGY s linaar 

(ii) HOUCUU TYPBt DNA (ganoaic) 
(xi) SBQUBNCB DB8CRIPTION1 8BQ ID NOilBs 
GGTCCAHNVN HVHHVWHVNN VNNVNWVNHV NHVHNVNHVN NVWNVNHVNN VNHVWNVHWV 60 
20 AOATCTOO ^® 

(2) INFORMATION FOR 8BQ ID NOsl9s 

(i) SBQUBNCB CHARACTBRISTIC81 

(A) UNOTHs 66 basa pairs 

(B) TYPBi nuclaic acid 

(C) STRANDEDNBBSt doubla 
25 (D) TOPOLOOYs linaar 

(ii) MOLBCULB TYPBt DNA (ganocnic) 

(xi) SBQUBNCB DBSCRIPTIONs 8BQ ID NOtl9s 
TCGACNNNBN NBNNBNNBNN BNNBNNBNNB NNBNNBNNBN NBNNBNNBNN BNNBNNBNNB 60 
CCAGOT 

30 

(2) INFORMATION FOR SBQ ID NOs20i 

(i) SBQUBNCB CHARACTBRISTICSt 

(A) LBNOTHs 65 bass pairs 

(B) TYPBt nuclaic acid 
<C) STRANDBDNBSS t doubla 

(D) TOPOLOGY! linaar 



35 



(ii) MOLBCULB TYPBt DNA (ganonic) 

(xi) SBQUBNCB DBSCRIPTIONs SBQ ID NOs20t 
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OOTCCANNVN NVNNVNNVNN VNNVMMVmiV NNVNNVNNVN NVNNVNNVNN VNNVNNVNNV 60 
AGATC 



65 



(2) INFORMATION FOR SBQ ID NOt21i 

5 (i) 8BQUBNCB CHARACTBRISTICSt 

(A) LBNOTBt 46 amino aclda 

(B) nrPBs Afflino acid 

(D| TOrOL06¥i linaar 

(ii) MOLBCOU nrPBt protein 
(v> FRAGKBNT TYPBt 

10 (ix) FBATURBt 

(A) NAKB/KBYt Modif i«d-ait« 

(B) LOCATimis 2 

(D) OTHER INFORMATION: /note- "Arg or Ser"* 
(xi) SBQOBNCB OBSCRIPTICms SBQ ID NO: 21s 



15 



Ser Xm Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

15 10 15 

Xaa Xaa Xaa Xaa Pro Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 



xaa xaa Xaa Xaa Xaa Xaa Ser Arg Pro Sar Arg Thr 
35 40 45 



(2) INFORMATION FOR SBQ ID NO: 22: 

20 

(i) SBQOBNCB CHARACTBRISTICS: 

(A) LBNOTH: 49 amino acids 

(B) TYPB: amino acid 
(D) TOPOLOGY: unknown 

<ii) MOUCULB TYPB: protain 

(xi) SBQOBNCB DBSCRIPTION: SBQ ID NO: 22: 



25 



amr BLm Smr Sar Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

is 10 15 

Xaa Xaa Xaa Xaa Xaa Pro Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 



SO 



*** *aa zaa xaa xaa Xaa Xaa Xaa Xaa Ser Arg Pro Sar Arg 

35 40 45 



Thr 



<2) INFORMATION FOR SBQ ID NO: 23: 

(i) SBQOBNCB CHARACTBRISTICS: 

(A) LBNGTH: 27 amino acids 
<B) TYPB: amino acid 
(D) TOPOXXXSY: linaar 

(ii) MOLBCOU TYPB: paptida 

(ix) FBATURB: 



- 66 - 



SUBSTITUTE SHEET (RULE 26) 



BN8DOCID: <WO_9e0364flA1JB> 



W 9d/03649 



PCT/DS9Sm382 



(A) HIUfB/KSYt Modifittd-flit* 

(B) IMATXONt 2 

(D) OTHSR IHFORHATIONt /not«- "S^r or Thr" 

(Ix) PBATtmSt 

<A) KAKB/KSYi Modi£ittd-Bit« 

(B) LOCATICMlt 13 

5 (D) OTBSR IMFORMXTIONi /not«- "Ser, Arg, Cly, Cy« or Trp- 

(ix) FSATURSs 

(A) HAKB/KXYt Modif i«d-«it« 

(B) LOCATIONS 15 ^ « 
(D) OTHSR INFORKATXCMIt /no^o- "Val, Ala, A»p, Olu or Oly- 

(xl) 8BQOBMCB DXSCRZPTZONs 8SQ ID NOt23i 

« A 8«r ZAA XM Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa 

X B 10 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Arg 

20 25 

(2) ZHFpRMATIOH POR 8BQ ID NO 1 24s 

<i) 8SQUBNCB CKARACTSRISTICSs 
j^tt (A) LBNGTHs 48 baaa pairs 

(B) TTPBs nuclaic acid 

(C) STRANDBDNBSSt both 
. (D) TOPOLOGY s linaar 

(ii) MOUICULB TYPBs ONA (ganottic) 

(xi) 8BQUBNCS DBBCRIPTZONs 8BQ ZD NOs24s 

20 TTTT6TCGAC NNNBNNBNNB NNBNNBNHBN NBNNBNNBNN BNGCGGTG 48 

<2) ZNPORNATXON FOR 8BQ ZD NOs25s 

(i) 8BW7BNCB CBARACTERZ8TZC8S 

(A) LENGTHS 48 baaa pair a 

(B) TYPES nuclaic acid 

(C) BTRANDEDNBSSt both 
25 (D> TOPOLOGY s linaar 

(ii) MOLBCOLE TYPE I DNA (ganomic) 

(xi) 8EQUBMCB DBBCRZPTZONs SBQ ID NO s 25s 

COCCACNVNN VNNVKNVNNV NMVNNVNNVN NVNNVNNVTC ATCATTTT 48 

30 (2) XNPCHUtATZON POR 8EQ ZD NOs26s 

(i) 8EQUENCE CHARACTERZSTZCBs 

(A) LENGTHS 43 baaa pair a 

(B) TYPBs nuclaic acid 

(C) 8TRANDE0NBSSS doubla 

(D) TOPOLOGY s linaar 

(ii) MOLECULE TYPES ONA (ganomic) 
(xi) SEQOENCE DESCRZPTZONs 8EQ ZD NOs26s 
CGCCACNVNN VNNVKNVNNV NNVNNVNNVN NVNNVNNVTG ATC 43 
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(2) INFORMATION FOR SBQ ID NOt27t 

(i) SSQUBNCB CHJUUiCnRISTICSt 
(A) LBNOTHt 43 hmmm pairs 
(B> TYPBi nucleic acid 
(C> STRANDSONSSSt doubla 
(D) TOPOLOGY s limar 

' (ii) MOLBCULB TYPBs DNA (ganomic) 

<xi) 8BQUBNCB OBSCRIPTIONt SBQ ID NOt27i 

TOOACNNNBN NBNNBNNBNN BNNBNNBNNB NNBNNBNGCO OTO 43 

(2) INFORMATION FOR 8BQ ID NOx28< 

(i) SBQOBNCB CHARACTBRIBTICSt 

(A) LBNOTH: 31 amino acids 
<B) TYPE I amino acid 
(D) TOPOLOGY t linaar 

(ii) MOLBCULB TYPBt protain 

(v) FRAGMENT TYPBx 

<ix) FBATURBs 

(A) NAMB/KBYs Modif iad-si^a 

(B) LOCATIONS 2 

(D) OTHER INFORMATION: /nota« "*Sar or Thr" 

(ix) FBATORBs 

(A) NAMB/RBYs Modif iad-aita 

(B) LOCATIONS 13 

20 (D) OTHER INFORMATIONS /nota« -Sar, Arg, Oly, Cys or Trp- 

(ix) FBATURBs 

(A) NAME/KEY s Modif iad-aita 

(B) LOCATIONS 15 

(D) OTHER INFORMATIONS /nota- "Val, Ala, Asp, Glu or Gly" 
(xi) SEQUENCE DESCRIPTIONS SBQ ID NO t 28s 

25 3Caa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa 

15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Arg Pro Sar Ara Thr 

20 25 30 

(2) INFORMATION FOR SBQ ID NOs29s 

30 (i) 8BQUBNCE CHARACTERISTICS s 

(A) LENGTHS 34 amino acids 

(B) TYPBs amino acid 
(D) TOPOLOGYs unknown 

(ii) MOLECULE TYPES protain 

(ix) FBATURBs 

(A) HAMB/KBYs Modif iad*sita 
35 (B) LOCATims 5 

(D) OTHER INFORMATIONS /n ta- "Sar r Thr- 

(ix) FEATURES 

(A) NAMB/KBYs Modif iad-*sita 

- 68 - 



SUBSTITUTE SHEET (RULE 26) 



BNSOOCtD- <WO_960364flA1.|B> 



wo 96/03649 



PCT/DS95A)93M 



10 



(B) LOCATIONt 16 

(D) OTHER INFORMATI N» /n t • -S«r, Arg, Oly, Cy« or Trp" 

(ix) PBATORSt 

<A) MAKB/KBYt Modif i*d-sit* 
(B) ZX)CATIOMi 18 

(D) OTHER INFORMATKWt /not«» "Val, Al«, A«p, Clu or Gly" 
<xi) SEQUENCE DESCRIPTION t SEQ ID NOt29t 

Bmr His Bmr Smr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xm Xaa Xaa Xaa Xaa 

15 10 15 

Glv Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Arg Pro Sar 

' 20 25 30 

Arg Thr 



INFORMATION FOR SBQ ID NOs30t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 8 amino acids 

(B) TYPEx amino acid 
(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPEt paptida 

(V) FRAGMENT TYPE: intarnal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

1 5 



(2> INFORMATION FOR SEQ ID NOs31i 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTHS 58 baaa pairs 
23 (B) TYPES nuclaic acid 

(C) STRANDEDNBSSs both 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPEt DNA (ganomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
TGACGTCTCG AOTTCTNNKN NKNNKNNKNN KNNKNNKNNK TGTGGATCTA GAAGGATC 

30 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 baaa pairs 

(B) TYPES nuclaic acid 

(C) STRANDEDNBSSs both 

(D) TOPOLOGY: unknom 

^' (ii) MOLECULE TYPE: DNA (ganomic) 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 32s 
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IS 



CCTAOATCTT CCTAG 

(2) INFORMATION FOR SSQ ID NOs33s 

(i) 8BQT7BNCB CHMACTSRISTICS t 

(A) LKNOTHt 40 b«0tt pairs 

(B) TYPXs nuclsic acid 

(C) STRANDXONXSSt dinibla 

(D) TOPOLOOYt linaar 

(ii) MOLBCUUe TYPXt OHX (ganomic) 
(Xi) 8BQ0KNCB DKSCRIPTICHf i SBQ ID NOt33t 
TOGAOTTGTN NKNNKNNKNN KNNKNNKNNK NNKTOTOOAT 40 

(2) INFORMATim FOR 8CQ ID NOt34s 

(i) SBQUBNCE CHARACTBRISTICSx 

(A) LBNGTHt 40 baaa pairs 

(B) TYPBs nuclaic acid 

(C) 8TRANDEDNBSS X doubla 
(O) TOPOIiOGYt linaar 



(ii) MOUBCULE TYPE: DNA (ganonic) 
(xi) SBQUBNCB DBSCRIPTIONt SBQ ID NOt34t 
CAACANNKNN KNNMNNMNNM NNMNNMNNMA CACCTAGATC 40 

(2) INFORMATION FOR SBQ ID NOs35s 

20 

(i) SBQOBNCB CHARACTBRISTICSi 

(A) LBNOTHi 19 amino acids 

(B) TYPBs amino acid 
(D) TOPOLOGYs unknown 



25 



(ii) HOhBCULE TYPBs protain 

(V) FRAGMENT TYPBs 
(xi) SBQUBNCB DBSCRIPTIONs SBQ ID NOs35s 

Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Gly Bar Arc Pro 

i 5 10 15 

Bar Arg Thr 

30 (2) INFORMATION FOR SBQ ID NOs36i 

(i) SBQUBNCB CRARACTBRISTZCS s 

(A) USNOTHs 28 amino acids 

(B) TYPBs amino acid 
(D) TOPOLOGY s unknoim 

(ii) MOUCOLB TYPBs protain 

35 (xi) 8BQ0BNCB DBSCRIPTIONs SBQ ID NO s 36s 

Sar Bar Cya Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Gly Sar Arg Sar 

15 10 15 
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Thr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Thr Arg 

20 25 

(2) INFORMATION FOR 88Q ID NOs37s 

(i) SSQfOSNCB CHIOUiCTBRISTICSs 

(A) X«BNOTHi 80 basa pairs 

(B) TYPBs nuclaic acid 

(C) STRANDBONXSSt doubla 

(D) TOPOLOOYt unkno«m 

(ii) MOLBCOLB TYPXt ONA (gancmic) 

(xi) SBQOXNCB DBSCRIFTIONi 8SQ ID NOs37t 
TC6AGTTOTN NKNNKNNXNN KNNKNNKNNK NNRTOTGOAT CTAGATCCAC AVNNVNNVNN 60 
VNNVNHVNNV NNVNNACAAC 60 

(2) INFORMATION FOR 8BQ ID NO«38i 

(i) BBQfDBNCB CHARACTSRI8TICS i 
X5 (A) LBNGTHt 45 amino acids 

(B) TYPEt amino acid 
(D) TOPOLOOYi unknoum 

(ii) MOLBCOLB TYPES paptida 

(xi) SEQUBNCB DBSCRIPTims SBQ ID NOs38i 

8ar Bar Pha Asp Gin Gin Asp Trp Asp Tyr Bar I la Ala Olu Lys Mat 
20 1 5 10 15 

His Pro I la Arg Pro Oly Pha Arg Olu Lau Pro Pro Lau Pro Pro Bar 

20 25 30 

Arg Ala Bar Pha Gly Gly Gly Ala Bar Arg Pro Bar Arg 
35 40 45 

2S (2) INFORMATION FOR BXQ ID NOs39s 

(i) BBQOBNCB CHARACTBRISTICBs 

(A) LBNGTHt 30 amino acids 

(B) TYPBs amino acid 
(0> TOPOLOGY s un)cno%m 

(ii) MOLECULE TYPES paptida 

30 (xi) BEQOBNCB OEBCRIPTIONs 8EQ ID NOs39s 

Bar Thr Aan Val Trp Val Thr Gly Bar Val I la Ala Arg Gly Ala Gin 

15 10 15 

Bar Arg Pro Lau Pro I la Pro Pro Glu Thr Arg Pro Bar Arg 

20 25 30 

35 (2) INFORMATim FOR BBQ ID MOs40s 

(i) BEQUENCE CHARACTERI8TICB s 

(A) LENGTHS 30 amino acids 

(B) TYPBs amin acid 
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(D) TOPOLOGYs unknown 

(11) HOUCUU TYPBt p«ptld0 

(xl) SBQUSNCB DBSCRIPTXONt S8Q ID NOs40i 

Smr Thr Ala Pro Trp Gly hmu hrg Val Ala Hla Glu Gly Gly Val Lau 
5 i 5 10 15 

i-v« Aro Pro I«u Pr© lie Pro Pro Val Thr Are Pre Ssr Ars 
20 25 30 

(2) INFORMATION FOR SBQ ID NOt41s 

(1) SBQUSNCB CHARACTBRI8TIC8 s 
10 (A) LBNOTHt 45 amino aclda 

(B) TYPBi amino acid 
(0) TOPOLOGY t unkno«m 

(11) MOLBCULB TYPBt paptlda 

(Xl) SBQOBNCB DBSCRIPTIONi SBQ ID NOt41t 

Sar Sar Sar Gly Tyr Val Val Pro Lya Arg Lau Gly Aap Mat Arg Glu 
15 1 5 10 15 

Tyr Asn Ala Hla Pro Gly Lau Hla Val Pro Pro Aan Sar Pro Lau Pro 

20 25 30 

Pro Lau Pro Thr Hla Lau Gin Sar Sar Arg Pro Sar Arg 
35 40 45 

20 (2) INFORMATION FOR SBQ ID NOs42l 

(1) SBQOBNCB CHARACTBRISTICSt 

(A) LBNGTHt 44 amino aclda 

(B) TYPBt amino acid 
(D) TOPOLOGY I unkno«m 

(11) MOLBCOLB TYPBt paptlda 

25 (xl) SBQUBNCB DBSCRIPTIONi SBQ ID NOs42t 

Sar sar Arg Gly Glu Gly Aan Asn Ila Ila Sar Sar Arg Pro Pha Lau 

15 10 15 

Sar Aan Sar Aap Pro Gly Val Sar Asn Lya X«au Thr Gly Arg Gly Pro 

20 25 30 

Lau Pro Pro Lau Pro Asn Asp Sar Arg Pro Sar Arg 
30 35 40 

(2) INFORMATION FOR SBQ ID NOt43t 

(1) SBQOBNCB CHARACTBRISTICSt 

(A) LBNOTHt 31 amino acids 

(B) TYPBt amino acid 
(D) TOPOLOGY t unknoum 



35 



(11) MOLBCOLB TYPBt paptlda 

(xl) SBQUBNCB DBSCRIPTI<») t SBQ ID NOt43i 
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Bmr Thr Ala Val S r Ph Arg Ph« Hmt Pr Cly Gly Cly Cly Aim Ph 

15 10 15 

Tyr Bmr Thr Arg Pro Val Pro Pr Il« Thr Arg Pro Smr Arg Thr 

20 25 30 



' <2) XNFORKATION FOR 8BQ ID NOt44> 

(i) 8BQOSHCB CBARACTBRZSTZCSs 

(A) UNOTHs 31 amino acidi 

(B) TTPBt amino acid 
(D) TOPOLOOYt unknoum 



10 



15 



20 



(ii) MOLBCULS TYPSt paptida 
(xi) SSQUBNCB DBSCRIPTIONs SBQ ZD NO 1 44 s 

8ar Thr Ala Hia 8ar hmu Trp Aap Trp Cly Thr Pha Sar Cly Val Sar 

15 10 15 

Hia Lya Sar Arg Lau Pro Pro X«au Pro Thr Arg Pro Sar Arg Thr 

20 25 30 

(2) INFORMATION FOR SBQ ID NOs45t 

(i) SBQOBNCB CHARACTBRISTICSs 

(A) LBNCTHs 25 amino acida 

(B) TYPBt amino acid 
(D) TOPOXXXSYi unknoim 

(ii) MOLBCULB mzi paptida 

(xi) SBQUBNCB DBSCRIPTIOHt SBQ 10 NOt45t 

Pro Cly Tyr Ala Arg I la Val Sar Tyr Arg Pha Arg Ala Lau Pro Sar 

15 10 15 

Pro Pro Sar Ala Sar Arg Pro Sar Arg 

20 25 

(2) INFORMATZCm FOR SBQ ID NOs46s 

(i) SBQUBNCB CaARACTBRISTICSs 

(A) XiBNCTHs 30 amino acida 

(B) TTPBt amino acid 
(D) TOPOLOGY I unkno%m 

(ii) KOLBCULB TYPBt papt^ida 

(xi) SBQUBNCB DBSCRIPTIONs SBQ ID NOs46s 

Sar Thr Aan Aap Val Aap Trp Mat Hia Mat Trp Aan Sar Gly Gly Pro 

15 10 15 

Hia Arg Arg Lau Pro Pro Thr Pro Ala Thr Arg Pro Sar Arg 
35 20 25 30 

(2) INFORMATICW FOR SBQ ID NOt47t 
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(i) 6BQXISNCB CHARACTKRISTICSt 

(A) LSKOTHi 45 Amln acids 

(B) TYPBs amin acid 
(D) tOPOIOOYt unknown 

(ii) MOLBCaU TYPBt paptida 

5 (xi) 8BQUBNCE DBSCRZPTZONi SBQ ID NOt47s 

Bar 8ar Aap Aan Trp Ala Arg Aro Val Hia Ala Bmr Cly Leu ti* Tvr- 
1 5 ' 10 " ii 

Thr Aap Lau Sar Pro Oly lla Lau Lau Ala Gin Arg Gin Lau Pro Pro 

20 25 30 

Thr Pro Gly Arg Aap Pro Sar Hia Sar Arg Pro Sar Arg 
10 35 40 45 

<2) INFORMATION FOR SBQ ID NOt48: 

(i) SBQUBNCE CHARACTBRISTICSt 

<A) LBNGTHt 45 amino acids 
(8) TTPBs amino acid 
(D) TOPOLOGYt unJcnown 

15 

(ii) MOLBCULB TYPBi paptida 

(xi) SBQOBNCB OBSCRIPTION: SBQ ID NOs48s 

Sar Sar Glu Sar Pro Lau Mat Tyr Aan Arg Val Oly Ala Lau Gin Sar 

is 10 15 

Lau Thr Sar Val Pro Gly Sar Mat Mat Hia Pha Ala Lau Gin Aro Arc 
20 20 25 30 

1-ou Pro Arg Thr Pro Pro Pro Ala Sar Arg Pro Sar Arc 
35 40 45 

(2) INFORMATION FOR SBQ ID NO s 49s 

(i) SBQUBNCB CHARACTBRISTICSt 
25 (A) UCNGTH: 35 amino acids 

(B) TYPBt amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLBCULB TYPBt paptida 

(xi) SBQUBNCB DESCRIPTION s SBQ ID NOs49i 

Sar Thr Arg Trp Sar His Sar Trp Pro Gly Tyr Val Gly Gly Ala Aan 
SO 1 5 10 15 

Pro Sar Pro Ala Thr Arg Pro Lau Pro Thr Arg Pro Sar Arg Thr Val 
20 25 30 

Glu Sar Cya 

35 

35 (2) INFORMATION FOR SBQ ID NOsSO: 

(i) SBQUBNCB CHARACTERISTICS: 

(A) LENGTH 1 50 amin acids 

(B) TYPE: amino acid 
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15 



35 



<D) TOPOLOGY s unknown 
(11) MOLBCUIJE TYPES pttptld* 
(xl) SBQUSHCS OBSCRIPTXON} 8BQ ID NO 1 50 s 

Bmr Arg Tyr Amn Asp hmu Gly Thr Arg Pro Val Smr Glu Val Xl« Lys 

15 10 15 

Tyr Asp Tyr Ph« Pro Gly Tyr Smr Gin His Val Ila Thr Pro Asp Gly 

20 25 30 

Sar Tyr Sar Thr Arg Pro hmu Pro 8ar Arg Pro Sar Arg Thr Val Glu 
35 40 45 

Sar Cya 

50 



(2) INFORMATION FOR 8BQ ID NOs51s 

(1) SBQUSNCB CHARACTBRISTICS: 

(A) LBNGTHs 24 anlno aclda 

(B) TYPES amino acid 
(D) TOPOLOGY s unkno%m 

(11) MOLECULE TYPES paptlda 

(xl) 8BQ0BNCB DESCRIPTIONS SEQ ID NOs51s 

Pro Gly Arg Lau X^u Lau Pro Sar Glu Pro Arg Thr Pha Tyr Aan Tyr 

15 10 15 

Gly His Asp Sar Arg Pro Sar Arg 
20 20 

(2) INFORMATION FOR SEQ ID NO s 52s 

(1) SEQUBNCB CHARACTERISTICS s 

(A) LBNGTHs 30 amino aclda 

(B) TYPES amino acid 
(D) TOPOLOGY s unknown 

(11) MOLECULE TYPES paptlda 

(xl) SEQUENCE DESCRIPTIONS SEQ ID NOs52i 

Sar Thr Mat Tyr Gly Val Sar Trp Lau Sar Sar Gly Sar Gly Gly I la 

15 10 15 

Lau Ala Pro Pro Val Pro Pro Arg Asn Thr Arg Pro Sar Arg 
30 20 25 30 

(2) INFORMATION FOR SEQ ID NOs53: 

(1) SEQUBNCB CHARACTERISTICS s 

(A) LENGTHS 31 amino aclda 

(B) TYPES amino acid 
(D) TOPOLOGY s unknown 



(11) MOLECULE TYPES paptlda 

(xl) SEQUBNCB DESCRIPTIONS SEQ ID NOs53s 
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Bmz Bmr Cym Thr GIu I*y« Thr Val Bmr Oly Trp Cy» Cly Ser Ara 8«r 

i 5 10 15 

Thr Arg Pr X^u Pr Il« L^u Pr Arg Thr Thr Arg Pro 8«r Arg 
20 25 30 

5 (2) INFORMATION FOR 8BQ ID NOi54x 

(i) SBQUBNCB CHARACTXRISTICSs 

(A) LBNOtHs 31 amino acids 

(B) TYPBt amino acid 
(D) TOPOLOGY t unknotm 

(ii) MOLSCULE TYPBs paptida 

10 (xi) SBOmCNCS DBSCRIPTIONs SBQ ID NOt54t 

Sar Sar Cya Kat Lau Pro Thr Aap Gly Trp Gin Cya Gly Sar Arg Sar 

is 10 15 

Thr Pro Arg Pro Lau Pro Mat Lau Pro Thr Thr Arg Pro Sar Aro 
20 25 30 

15 (2) INFORMATION FOR SBQ ID NOsSSt 

(i) SBQUENCE CHARACTERISTICS t 

(A) LENGTHS 31 amino acida 
<B) TYPES amino acid 
(D) TOPOLOGY s unknom 

(ii) MOLECULE TYPES paptida 

20 (xi) SEQUENCE DESCRIPTIONS SBQ ID NOs55i 

Sar Sar Cya Aap Cly Thr Gin Pha Arg Lau Aan Cya Gly Ser Arg Sar 
15 10 15 

Thr Aan Arg Pro Lau Pro Mat I la Pro Thr Thr Arg Pro Sar Arg 

20 25 30 

25 (2) INFORMATION FOR SBQ ID NOs56i 

(i) SBQUBNCB CHARACTERISTICS: 

(A) LENGTHS 31 amino acida 

(B) TYPES amino acid 
(D) TOPOLOGYs unknown 

(ii) MOLECULE TYPBs paptida 

30 (x^) SBQUBNCB DBSCRIPTIONs SBQ ID N0sS6s 

Sar Sar Cya Mat Gin Gly Gin Ala Gly Lau Lya Cya Oly Sar Arg Sar 

15 10 15 

Thr Arg Pro Lau Pro Sar Lau Pro I la Thr Thr Arg Pro Sar Arg 

20 25 30 



35 



(2) INFORMATION FOR SBQ ID NOs57s 
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(i) 8SQUBNCS CHARACTBRZSTICSs 

(A) LKN THt 31 aiiiin acids 

(B) ryPBs amino acid 
(D) TOPOLOOYs uxiknown 

(ii) MOUCULB TYPE I paptida 

9 (xi) 8BQUSNCS DBSCRIPTimit 8BQ 10 NOt57i 

sar Sar Cya Tyr Xrg Olu Lya Aap ?hr Trp Oly Cya Cly Sar Arg Bar 

15 10 15 

Thr sar Arg Pro Lau Pro Sar Lau Pro Thr Thr Arg Pro Sar Arg 

20 25 30 

XO (2> INPORMATX<m FOR SEQ ZD NOt58s 

(i) 8BQOBNCS CHARACTBRISTICSt 

(A) LENGTH t 32 amino acida 

(B) TYPEt amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE I paptida 

X5 (xi) 8SQUBHCE DESCRIPTIONt SEQ ID NOtSSi 

Sar Sar Cya Lau Pha Glu Gin Gly Ala Gly Thr Cya Gly Sar Arg Sar 

15 10 15 

Thr Arg Sar Lau Pro Pro Lau Pro Pro Thr Thr Arg Pro Sar Sar Arg 
20 25 30 

20 (2) INFORMATION FOR SEQ ID NOs59t 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTHS 44 amino acida 

(B) TYPES amino acid 
(D ) TOPOLOGY I unknom 

(ii) MOLECULE TYPES paptida 

25 (xi) SEQUENCE DESCRIPTIONS SEQ ID NOs59s 

Sar Sar Cya Aap Thr Gly Arg Ila Ala Pro Gly Cya Gly Sar Arg Sar 

15 10 15 

Thr Pro Arg Pro Lau Pro Lau Ila Pro Thr Thr Pro Arg Sar Thr Aan 
20 25 30 



30 



Lau Aan Lau Thr Sar Thr Thr Thr Arg Pro Sar Arg 
35 40 



(2) INFORMATION FOR SEQ ID NOs60s 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 31 amin acida 

(B) TYPES amino acid 
(D) TOPOLOGY s unknotm 

(ii) MOLECULE TYPES paptid 
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(xi) SBQUBNCB DESCRIPTION: SBQ ID NOtSOt 

Bmr Smr Cym Cly hmu Amp Asn Ala Ala Lya Thr Cya Cly Bar Arg 8ar 

15 10 15 

Thr Arg Pro Lau Pro Pro Thr Pro Lau Thr Thr Arg Pro Sar Arc 
20 25 30 

5 

(2 ) INPORMATION FOR SBO ID NOt61t 

(1) 8SW7XHCB CHARACTKRISTICSs 

(A) LBKOTHt 31 amino acida 

(B) TTPBt amino acid 
(D) TOPOLOGYt unknoum 

10 (ii) NOUCULB TTPBi paptida 

(xi) SBQUBNCB DESCRIPTION t SBQ ID NOf61s 

Sar Sar Cya Sar Arg Ala Hia Glu Thr Glu Mat Cya Gly Sar Arg Sar 

15 10 15 



15 



Thr Arg Pro Gin Pro Pro Pro Pro I la Thr Thr Arg Pro Sar Arg 
20 25 30 

(2) INFORMATION FOR SBQ ID NOi62t 



(i) SBQUBNCB CHARACTBRISTICSt 

(A) UNGTHs 14 amino acida 

(B) TyPBs amino acid 
(D) TOPOLOGY! unknotm 

20 (ii) MOLECULE TYPBt paptida 

(xi) SBQUBNCB DBSCRIPTION: SBQ ID NO s 62s 

Arg Sar Thr Pro Arg Pro Lau Pro Mat Lau Pro Thr Thr Aro 
15 lo ^ 



25 



5 10 

(2) INFORMATION FOR SBQ ID NOi63t 

(i) SBQUBNCB CHARACTERISTICS! 

(A) LENGTH! 13 amino acida 

(B) TYPES amino acid 
(D) TOPOLOGY! unknom 

(ii) MOLECULE TYPE! paptida 

30 

(xi) SEQUENCE DESCRIPTION! SBQ ID NO!63s 

Gly Ila Lau Ala Pro Pro Val Pro Pro Arg Aan Thr Arg 

15 10 

(2) INFORMATION FOR SBQ ID NOs64! 

35 (i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH! 13 amin acida 

(B) TYPES amino acid 
(D) TOPOLOGY! unknown 
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(11) MOIJCCULS TYPBs p«ptl<i# 
<xl) SBQtTBKCB DBSCRXPTIONt SEQ ID NOt64t 

Val tmu hym hrg Pro Lmu Pro Il« Pro Pro Val Thr Arg 

15 10 

(2) XKPORMIiTXON FOR 8BQ ID NOt65t 

(1) 8BQOBNCB CHARACTSRISTXCS t 

(A) XiBNOTHs 13 amino acids 

(B) TTPBi amino acid 
(D) TOPOLOGY t unknown 

(11) MOLBCULB TYPBt p^ptlda 

(Xl) SBQUBMCS DBSCRXPTXONs 8BQ XD NOt65t 

Gly Pro His Arg Arg Lau Pro Pro Thr Pro Ala Thr Arg 

15 10 

(2) XNPORKATXON FOR SBQ XO HO :66s 

25 (1) SBQUBNCB CHARACTERXSTICSt 

(A) LBMGTHt 13 amino acids 

(B) TYPBs amino acid 
(D) TOPOLOGY s unJcnown 

(11) MOLBCX7LB TYPBs paptlds 

(xl) SBQUBNCB DBSCRXPTXONs SBQ ID KOs66s 

2 0 Ala Asn Pro Sar Pro Ala Thr Arg Pro Lsu Pro Thr Arg 

1 5 10 

(2) INFORMATICMI FOR SBQ ID NOs67s 

(1) SBQUBNCB CHARACTBRISTICSs 

(A) LBNGTHs 14 amino acids 

(B) TYPBs amino acid 
25 (D) TOPOXiOGYs unknoim 

(11) MOLBCULB TYPBs paptlda 



(xl) BZQOmna DBSCRIPTXONs SBQ id NOs67s 

Arg Sar Thr Pro Arg Pro hmM Pro Pro Lau Pro Thr Thr Arg 
30 1 5 10 



(2) INFORMATICS FOR SEQ ID NOs68s 

(1) SBQUBNCB CHARACTBRISTICSs 

(A) LBNGTHs 19 amino acids 

(B) TYPBs amino acid 
(D) TOPOLOGY s unkno%m 

(11) MOLBCULB TYPBs paptlda 

(xl) SBQUBNCB DESCRIPTIONS SBQ ID NO s 68s 
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Oly Thr Val lu Pr Val Pro Pro Pro Val Pr Pro Arg Arg Arc Pr 
^5 10 is"* 

lu Sar Ala 

(2) IKPORMATION FOR SKQ ID llOi69t 

(i) SSQOSNCB CHARACTSRISTICSt 

(A) X.SNOTHI 17 amino acida 

(B) TYPBs amino acid 
(D) TOPOLOOYs unknom 

(ii) KOLBCULB TYPXt paptida 

(xi) SBQOBKCB DKSCRIPTZONs 8BQ ZD NOt69t 

Lya I la Sar Pro Pro Thr Pro Lya Pro Arg Pro Pro Arg Pro Lau Pro 

15 10 15 

Val 

<2) INFORMATION PGR 8BQ ID NOs70s 

15 (i) 5EQUBNCB CHARACTBRISTICS : 

(A) LBNGTHt 13 amino acida 

(B) TYPBs amino acid 
(D) TOPOLOGY I unknown 

(ii) MOLBCULB TYPBi paptida 

(xi) SBQUBNCB DBSCRIPTIONt 8BQ ID NOs70s 

20 Ala Pro Pro Lya Pro Pro Lau Pro Glu Gly Glu Val 

is 10 

(2) INFORMATIC»l FOR SBQ ID NOt71» 

(i) SBQUBNCB CHARACTBRISTICS t 

(A) LBNGTHs 7 amino acida 

(B) TYPBi amino acid 
(D) TOPOLOGY: unknown 

25 

(ii) MOLBCULB TYPBs paptida 

(ix) FBATURBt 

(A) NAMB/KBY: Nodif iad-aita 

(B) LOCATIONS 2 

(D) OTHBR INFORMATIONS /nota- "any raaidua othar than Cy«- 

(ix) FBATURBs 
30 (A) NAMB/KBYs Modif iad-aita 

(B) LOCATIONS 5. .6 

(D) OTHBR INFORMATIONS /nota« "any hydrophobic raaidua" 
(xi) SBQUBNCB DBSCRIPTIONs SBQ ID NOs71s 

Arg Xaa Lau Pro Zaa Xaa Pro 

1 5 



35 



(2) INFORMATION FOR SBQ ID NOs72s 

(i) SB^^NCB CHARACTBRISTICS S 

(A) LBNGTHs 13 amino acida 
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(B) TYPBt amin acid 
(O) TOPOLOOYs unkno%m 

(ii) MOUCUU TYPBi paptlda 

(xi) SBQXISNCS DSSCRXPTXONt 8BQ ID NOi72t 

■ Pro Gly Pha Arg Olu L«u Pro Pro Lau Pro Pro Ser Arg 

' 1 5 10 

(2) INFORMATION FOR 8BQ ID NOi73i 

(i) SBOOBNCB CHARACTBRISTICSi 

(A) UNOTHs 13 amino acids 

(B) TYPBt amino acid 
^0 (D) TOPOLOGY t unkno%m 

(ii) KOLBCULB TYPBi paptida 

(xi) SBQUBNCB DBSCRIPTtONx SBQ ID NOt73t 

Ala Gin Bar Arg Pro Lau Pro I la Pro Pro Glu Thr Arg 

1 5 10 



15 



20 



(2) INFORMATION FOR SBQ ID NOt74t 

(i) SBQUBNCB CHARACTBRISTICSs 

(A) LBNGTHi 13 amino acida 

(B) TYPBi amino acid 
(D) TOPOLOGY t unluiown 

(ii) MOLBCOLB TYPBt paptida 

(xi) SBQUBNCB DBSCRIPTIONs SBQ ID NOs74t 

Pro Pro Aan Bar Pro Lau Pro Pro Lau Pro Thr Hia Lau 

15 10 



(2) INFORMATION FOR SBQ ID NO 1 75s 

25 

(i) SBQUBNCB CHARACTBRISTICSi 

(A) LBNGTHs 13 amino acida 

(B) TYPBi amino acid 
(D) TOPOLOOYi unlcnown 

(ii) MOLBCULB TYPBt paptida 

(xi) SBQUBNCB DBSCRIPTICMIt SBQ ID NOt75t 



30 



Thr Gly Arg Gly Pro Lau Pro Pro Lau Pro Aan Aap Sar 

15 10 



(2) INFORMATION FOR SBQ ID NOt76t 



(i) SBQUBNCB CBARACTBRISTICS t 

(A) LBNGTHs 13 amino acids 
35 (B) TYPBs amino acid 

(D) TOPOLOGYt unlcnown 

(ii) MOLBCULB TYPBt paptida 
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(Xi) SBQOBNCB DBSCRIPTIONt SEQ ID NOx76s 

Tyr Smr Thr Arg Pro Val Pr Pro Il« Thr Arg Pr S«r 

15 10 

(2) INFOFMATZON FOR S8Q ID NOt77s 

(i) SEQOKHCK CBARACTSRISTXCSt 

(A) LSNGTBs 13 Amino acids 

(B) TTPSi amino acid 
(O) TOPOLOGY s unkno%#n 

(ii) MOUCOU TYPBt paptida 

(xi) 8BQ0SNCS DXSCRIPTIQNt 8BQ ID llOs77t 

Bar Hia Lya Bar Arg I#au Pro Pro Lau Pro Thr Arg Pro 

15 10 

(2) INFORMATION FOR SBQ ID NOs78i 

(i) SBQOBNCB CHARACTBRISTICSs 

(A) LENGTH: 13 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGYs unknown 

(ii> MOLBCULB TYPBt paptida 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NOt76t 



10 



20 



30 



B5 



Tyr Arg Pha Arg Ala Lau Pro Sar Pro Pro Sar Ala Sar 

15 10 



<2) INFORMATION FOR SEQ ID NOs79i 



(i) SEQUENCE CHARACTERISTICS t 

(A) LBNGTHt 13 amino acids 

(B) TYPBt amino acid 
(D) TOPOLOGY t unkno«m 

25 (ii) MOLECULE TYPBt paptida 

(xi) SEQUENCE DESCRIPTION: SBQ ID NOt79t 

Lau Ala Gin Arg Gin Lau Pro Pro Thr Pro Gly Arg Aap 

15 10 



(2) INFORMATION FOR SBQ ID NOsSOt 

(i) SEQUENCE CHARACTERISTICS t 

(A) LBNGTHi 13 amino acids 

(B) TYPBt amino acid 
(D) TOPOLOGY t unlcnown 

(ii) MOLBCULB TYPBt paptida 

(xi) SEQUENCE DESCRIPTIONt SEQ ID NOtSOt 

Ala Lau Gin Arg Arg Lau Pro Arg Thr Pr Pr Pro Ala 

15 10 
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(2) INFORMATION FOR SBQ ID NOsBli 

<i) S8QUBNCB CHARACTERISTICS s 

(A) LBNOTHt 13 mmln acids 

(B) TYPES amino acid 
(D) TOPOLOQYs unkno%m 

(ii) MOLBCUUK TTPEt pttptida 

(Xi) SEQUENCE DESCRIPTION! SEQ ID NOsSli 

Pro Ala Thr Arc Pro tm\x Pro Thr Arg Pro Sor Arg Thr 
15 10 



(2) INFORMATION FOR SBQ ID NOt82i 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTHS 13 amino acids 

(B) TYPES amino acid 
(D) TOPOLOGY s unknoim 

(ii) MOLECULE TYPES paptida 

(xi) SEQUENCE DESCRIPTIONS SBQ ID NOs82s 



X5 



Tyr Sar Thr Arg Pro I*au Pro Sar Arg Pro Ser Arg Thr 
15 10 



(2) INFORMATION FOR SEQ ID NO s 83s 

(i) SEQUENCE CHARACTERISTICS s 
20 (A) LENGTHS 13 amino acida 

(B) TYPES amino acid 
(D) TOPOLOGYs unknoim 

(ii) MOLECULE TYPES paptida 

(xi) SEQUENCE DESCRIPTIONS SBQ ID NOt83s 

Zaa Pro Gly Arg I la Lau Lau L.au Pro Sar Glu Pro Arg 
25 1 5 10 

(2) INFORMATION FOR SEQ ID NOs84s 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 13 amino acids 

(B) TYPES amino acid 
(D) TOPOLOGYs unknoim 

(ii) MOLECULE TYPES paptida 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NOs84: 

Sar Gly Gly Ila Lau Ala Pro Pro Val Pro Pro Arg Asn 

15 10 

35 (2) INFORMATKHI FOR SBQ ID NOs85s 

(i) SEQUENCE CHARACTERISTICS S 

(A) LENGTHS 13 amin acids 

(B) TYPES amin acid 



SO 
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(D) TOPOLOGY t unknown 

(ii) MOUCOLE TYPE I p#ptid 

(xi) SSQOSNCB DESCRIPTION t SEQ ID NOt85t 

hrg amr Thr hrg Pro Lmu Pro 11^ t^u Pro Arg Thr Thr 
is 10 

(2) IKFORMATION FOR ESQ ID NOt86t 

(i) SEQUENCE CHARACTERISTICS t 

(A) UfiNOTHs 13 amino aclda 

(B) TYPES amino acid 
(D) TOPOLOGY! unknown 

(ii) MOLECULE TYPE I paptida 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NOt86t 

Sar Thr Pro Arg Pro Lau Pro Mat Lau Pro Thr Thr Arg 
15 10 



(2) INFORMATION FOR SEQ ID NOt87s 

(1) SEQUENCE CHARACTERISTICS t 

(A) LENGTHS 13 amino acida 

(B) TYPES amino acid 
(D) TOPOLOGY: unknown 

20 (ii) MOLECULE TYPE: paptida 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NOt87s 

Sar Thr Aan Arg Pro Lau Pro Mat I la Pro Thr Thr Arg 

15 10 

(2) INFORMATION FOR SEQ ID NOsSSi 

(i) SEQUENCE CHARACTERISTICS I 

(A) LENGTHS 13 amino acida 

(B) TYPES amino acid 
(D) TOPOLOOYs unknom 

(ii) MOLECULE TYPEt paptida 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NOs88s 

Arg Sar Thr Arg Pro Lau Pro Sar Lau Pro I la Thr Thr 

1 5 10 

(2) INFORMATION FOR SEQ ID NO 1 89s 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTHS 13 amino acida 
35 (B) TYPES amino acid 

(D) TOPOLOGY s unknom 

(ii) MOLECULE TYPES paptid 
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30 



(xi) SKQUKNCB DXSCRXPTXONt SEQ ID NOt89i 

Bmr Thr S^r Arg Pro L«u Pr Smr Lmu Pr Thr Thr Arg 
IS 10 

(2) INFORMATZ(»l POR 8XQ ID NOs90t 

(i) SSQOSNCB CHARACTERISTICS t 

(A) UCNOTHs 13 amino acids 

(B) TYPBt amino acid 
(D) TOPGLOOYt unknoMi 

(ii) KOLBCUUE TYPBt paptida 

(xi) SBQUBNCB DBSCRIPTICH9S SBQ ID MOt90t 

Aro Bmr Thr Arg Pro X«au Pro Sar Lau Pro Ila Thr Thr 

1 5 10 



(2) INFORMATION FOR SBQ ID NOt91s 

25 (i) SEQUENCE CHARACTERISTICS t 

(A) LENGTHS 13 amino acids 

(B) TYPES amino acid 
(D) TOPOLOGY s unknown 

(ii) MOLECULE TYPES paptida 

(xi) SEQUENCE DB6CRIPTI«ls SEQ ID HOs91s 

20 ^9 ^9 '^^^ 

15 10 



(2) INFORMATION FOR SEQ ID NO s 92s 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 13 amino acida 
25 (B) TYPES amino acid 

(D) TOPOLOGY s un)cnoim 

(ii) MOLECULE TYPEs paptida 

(xi) SEQUENCE DBSCRIPTKMIs SBQ ID NOs92s 

Sar Thr Pro Arg Pro Lau Pro Lau Ila Pro Thr Thr Pro 

15 10 



(2) INFORMATION FOR SEQ ID NOs93s 



(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 13 amino acida 

(B) TYPES amino acid 
(D) TOPOLOGY s un)cno%m 

35 (ii) MOLECULE TYPES paptida 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NOs93s 

Arg Sar Thr Arg Pro Lau Pr Pr Thr Pr Lau Thr Thr 
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15 10 

(2) INFORMATION FOR S£Q ID NOt94t 

(i) SBQDKNCS CHARACTBRISTICSt 

(A) LBNOTHi 13 amino acids 

(B) TYPEt amino acid 
(D) TOPOLOGY s unknown 

(ii) MOLBCULB TYPBi |wptida 

(xi) 8BQOKNCB DBSCRIPTIONi SBO ID NO t 94s 

Arg Bar Thr Arg Pro Oln Pro Pro Pro Pro I la Thr Thr 
15 10 



(2) INFORMATION FOR SBQ ID NOs95s 

(i) SBQUBNCB CHARACTBRISTICSt 
<A) LBNGTHs 7 amino acida 
(B) TYPBi amino acid 
(O) TOPOLOGY: linaar 

(ii) MOLBCULB TYPBs paptida 

(xi) SBQUBNCB DBSCRIPTimt SBQ ID NOi95i 

Arg Pro Lau Pro Mat Lau Pro 

1 5 



20 (2) INFORMATION FOR SBQ ID NOs96s 

(i) SBQUBNCB CHARACTBRISTICSt 
<A) LBNGTHs 7 amino acida 
(B) TYPBt amino acid 
(D) TOPOLOGY s linaar 

(ii) MOLBCULB TYPES paptida 

25 (ix) FEATURES 

(A) NAME/KEY s Modif iad-aita 

(B) LOCATIONS 5 

(D) OTHER INFORMATIONS /nota- "Pro or Mat* 

(Xi) SEQUENCE DESCRIPTIONS SBQ ID NO s 96s 

Arg Pro Lau Pro Xaa Lau Pro 
30 1 5 

(2) INFORMATICm FOR SBQ ID NOs97s 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 4 amino acida 

(B) TYPES amino acid 
(D) TOPOLOGYs linaar 



35 



(ii) MOLECULE TYPES paptida 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NOs97s 
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Arg 8«r Thr Pr 

1 



(2) INFORMATION FOR 8SQ 10 NOi98s 

<i) SBQUBNCB CHARACTBRISTICSt 
5 (A) LSNOTHt 14 amino acidi 

(B) TYPSt aalno acid 
(D) TOPOLOGY s unknoim 

(11) MOLBCULB TYPBt peptide 



(Xi) SBQUBNCB DBSCRIPTIONt SBQ ID NOt9ds 

10 

Arg 8«r Thr Pro AIa Pro Pro V8l Pro Pro Krg Thr Thr Arg 
15 10 

(2) INFORMATION FOR 8BQ ID NOt99s 

(1) 8BQX7BNCB CHARACTBRISTICSt 
(A) LBNOTHt 17 h^mm pairs 
X5 (B) TYPBi nuclalc acid 

(C) 8TRANDBDNBSS: un)cno%m 

(D) TOPOLOGY: llnaar 

(11) MOLBCULB TYPE: DNA (ganomlc) 
(xl) SBQUBNCB DBSCRIPTION: SBQ ID NOt99i 
A6CGTAAC6A TCTCCOG 17 

20 

(2) INFORMATION FOR SBQ ID NO: 100 1 

(1) SBQUBNCB CHARACTERISTICS: 

(A) LENGTH: 20 baaa pair a 

(B) TYPE: nuclalc acid 

(C) STRANDBDNBSS : unknown 
(O) TOPOLOGY: llnaar 

25 

(11) MOLBCULB TYPE: DNA (ganonlc) 
(xl) SBQUBNCB DBSCRIPTION: SBQ ID NO: 100: 
TTCACCTCOA AAGCAAGCTG 20 

(2) INFORMATION FOR SBQ ID NO: 101 x 

30 

(1) SBQUBNCB CHARACTBRISTICSt 

(A) LENGTH: 19 baaa pair a 

(B) TYPE: nuclalc acid 

(C) STRANDBDNBSS: unlcnotm 

(D) TOPOLOGY: llnaar 

(11) MOLBCULB TYPBt DNA (ganomlc) 
35 (xl) SBQUBNCB DBSCRIPTION: SBQ ID NO:101t 

CCTCATAGTT AGCGTAACG 19 
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(2) IHFOJUCATZON FOR SSQ ID NO s 102: 

(1) 8BQDBMC8 CBKMiCTZRlBriCS t 

(A) LBNOTHi 16 pairs 

(B) TYPBt nucl ic acid 

(C) STRANDKONISSi unknown 

(D) TOPOLOOyt linear 

(ii) MOLSCaU TYPKt DNA (ganonic) 



(xi) SBQOBHCB DSSCRIPTIONt SBQ ID NOil02s 
AGCGTAACGA TCTAAA 16 



10 (2) INPORMATION POR SSQ ID N0tl03t 

(1) SSODBNCB CHARACTBRISTICSt 

(A) LBNGTHs 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY s linaar 

(ii) MOLSCULE TYPBs peptide 

IS (xi) SBQOBNCB DBSCRIPTIONs SBQ ID NOil03t 

I*ys Ser Gly Ser Gly 

1 5 



(2) INFORMATION FOR SBQ ID NOsl04t 

20 (i) SBQOBNCB CHARACTBRISTICSt 

(A) X«BNGTHt 4 amino acids 

(B) TYPBs amino acid 
(D) TOPOLOGY s linear 

(ii) MOLBCOLE TYPEt peptide 

(xi) SBQOBNCB DESCRIPTION I SBQ ID NOtl04t 

25 Gly Ser Gly Ser 

1 

(2) INFORMATION FOR SBQ ID NOslOSt 

(i) SBQaBNCB CHARACTBRISTICSt 

(A) LENGTH I 11 amino acids 

(B) TYPES amino acid 
30 (D) TOPOLOGY: unknown 

(ii) MOLBCXTLE TYPEt peptide 

(xi) SEQUENCE DESCRIPTIONt SBQ ID NOtlOSt 



35 



Glu Gin Lys Leu lie Ser Glu Glu Asp Leu Asn 

15 10 



(2) INFORMATI<»f FOR SEQ ID N tl06t 
(i) SBQOENCB CHARACTBRISTICSt 
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(A) LBNGTHi 31 amino ACids 

(B) TYPSt wain acid 
(D) TOFOLOGYt unknoim 

(ii> KOLBCtTU TYPBt p«ptida 

(Xi) 8SQ0SNC8 DBSCRIPTIONs SSQ ID NOsl06i 

Smr Smr Cym Asp Hia Thr Lau Gly Lmu Gly Trp Cya Oly Sar Arg 8ar 

1 5 10 IS 

Thr hrg Oln L«u Pro Xla Pro Pro Thr Thr Thr hrg Pro Bar Arg 
20 25 30 



10 



15 



20 



25 



30 



35 
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WHAT IS CIAIMEO IS: 

1. A peptide having at least nine and up t f rty-flve 
amino acid residues. Including an amln acid sequence of the 
formula, R-2-L-P-5-6-P-8-9 (SEQ ID NO: 10), positioned 

5 anywhere along the peptide. In which each number represents 
an amino acid residue., such that 2 represents any amine acid 
residue except cysteine, 5 and 6 each represents a 
hydrophobic amino acid residue, 8 represents any amino acid 
residue except cysteine, and 9 represents a hydrophilic amino 
10 acid residue except cysteine, each letter being the standard 
one-letter symbol for the corresponding amino acid, said 
peptide exhibiting a binding affinity for the SH3 domain of 
Src, provided that said peptide is not R-P-l-p-p-l-P-t-s (SEQ 
ID NO: 11) . 

2. The peptide of claim 1 in which 2 is a P, R, A, L, 
Q, E or S. 

3. The peptide of claim 1 in which 5 is a P, M, I or 

4. The peptide of claim l in which 6 Is a P, L, I or 

20 V. 

5. The peptide of claim l in which 8 is a T, R, p, I, 
N, E, V, S, A, G or L. 

6. The peptide of claim l in which 9 is a T, R, S, H 

or D. 

25 7. The peptide of claim l which further comprises a 

C-termlnal-flanklng amino acid seqpience of the formula 10, 
10-11, 10-11-12, 10-11-12-13 (SEQ ID NO: 12) or 10-11-12-13-14 
(SEQ ID NO: 13), in Which each number represents any amino 
acid residue except cysteine, such that lo is bound to 9 by a 

30 peptide bond. 

8. The peptide of claim 7 in which 10 is T, R, L, S, 
D, P, A or N. 

9. The peptide of claim 7 in which 11 is R, P, A, Q, S 

or T. 

3S 10. The peptid of claim 7 in which 12 is P, S, R or T. 

11. The peptide of claim 7 in which 13 is P, S, R, F, H 



or T, 
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12. Th peptide of claim 7 in which 14 is s, R, G or T. 

13. Th peptid t claim 1 which fiurth r compris s an 
N-terminal-f lanking amin acid sequence t th formula 1\ 
2»-l', 3»-2'-l' or 4«-3*-2»-l' (SEQ ID MO:14) in which each 

S number represents any amino acid residue except cysteine,'^ 
such that 1* is bound to R by a peptide bond. 

14. The peptide of claim 13 in which 1* is T, P, S, N, 

F, W, K, Q or G. 

15. The peptide of claim 13 in which 2* is S, T, G, 
10 R, A or H. 

16. The peptide of claim 13 in which 3* is R, S, P, G, 
A, V, Y or L. 

17. The peptide of claim 13 in which 4* is R, S, V, T, 

G, L or F. 

IS 18. A peptide having at least thirteen and up to forty- 

five amino acid residues, including an amino acid sequence of 
the formula, 3 '-2 •-l»-R-2-L-P-5-6-P-8--9-10 (SEQ ID NO:15), 
positioned anywhere along the peptide, in which each number 
represents an amino acid residue, such that 3*, 2*, 1*, 2, 8, 

20 and 10 each represents any amino acid residue except 

cysteine, 5 and 6 each represents a hydrophobic amino acid 
residue, and 9 represents a hydrophilic amino acid residue 
except cysteine, each letter being the standard one-letter 
symbol for the corresponding amino acid, said peptide 

25 exhibiting a binding affinity for the SH3 domain of Src. 

19. The peptide of claim 18 in which 5 is a P or M. 

20. The peptide of claim 18 in which 1* is T, P, S or 

M. 

21. The peptide of claim 18 in which 2* is S or T. 
30 22. The peptide of claim 18 in which 3* is R or S. 

23. The peptide of claim 18 in which 10 is T or R. 

24. The peptide of claim 1 the binding affinity of 
which is at least three-fold greater than that exhibited by 
the peptide RPLPPLP for the SH3 domain of Src. 

35 25. The peptide f claim 18 th binding affinity f 

which is at 1 ast three-f Id gr ater than that exhibit d by 
the p ptide RPU^PLP (SEQ ZD NO: 9) for the SH3 domain of Src. 
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26. The p ptid f claim 1 th binding affinity f 
which is at 1 ast four-fold gr ater than that xhibited by 
the peptide RPLPPLP (SEQ ID NO: 9) for th SH3 d main of Src. 

27. The peptide of claim 18 the binding affinity of 
5 which is at least four-fold greater than that exhibited by 

the peptide RPIiPPLP (SEQ ID NO: 9) for the SH3 domain of Src. 

28. The peptide of claim 1 which further exhibits a 
general binding affinity for the SH3 domain of Abl, Grb2, 
PLC-*, Ras GAP, Nek, p85 PI-3 "Kinase, and proteins 

10 related thereto. 

29. The peptide of claim 1 which exhibits a selective 
binding affinity for the SH3 domain of Src and Src-related 
proteins . 

30. The peptide of claim 18 which further exhibits a 
15 general binding affinity for the SH3 domain of Abl, Grb2, 

PLC-5, PLC-7, Ras GAP, Nek, p85 PI-3 •Kinase, and proteins 
related thereto. 

31. The peptide of claim 18 which exhibits a selective 
binding affinity for the SH3 domain of Src and Src-related 

20 proteins. 

32 . A peptide having the amino acid secnience 
RSTPRPLPNU>TTR (SEQ ID NO: 62). 

33. A peptide having the amino acid sequence 
RSTPRPLPPLPTTR (SEQ ID NO: 67). 

25 34. A peptide having the amino acid sequence 

GILAPPVPPRNTR (SEQ ID NO: 63). 

35. A peptide having the amino acid sequence 
VLKRPLPIPPVTR (SEQ ID NO: 64). 

36. A peptide having the amino acid sequence 
30 GPHRRLPPTPATR (SEQ ID NO: 65). 

37. A peptide having the amino acid sequence 
ANPSPATRPLPTR (SEQ ID NO:66). 

38. A peptide having an amino acid sequence selected 
from the group consisting of RSTRPLPILPRTT, STPRPLPMI*PTTR, 

35 STNRPLPMIPTTR, RSTRPLPSLPITT, STSRPLPSLPTTR, RSTRSLPPLPPTT , 
RSTRQLPIPPTTT, STPRPLPLIPTTP , RSTRPLPPTPLTT , and 
RSTRPQPPPPITT (SEQ ID NOS: 85-94). 
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39. A peptid having the amino acid s (juenc selected 
from the group consisting of VliKRPLPIPPVTR (SEQ ID NO: 64), 
YSTRPVPPITRPS (SEQ ID NO: 76), SHKSRLPPLPTRP (SEQ ID NO: 77), 
GPHRRLPPTPATR (SEQ ID NO:65), PATRPLPTRPSRT (SEQ ID NO:^l), 

5 and SGGIIAPPVPPRN (SEQ ID NO: 84). 

40. A peptide having the amino acid sequence selected 
from the group consisting of PPNSPLPPLPTHL (SEQ ID NO: 72), 
T6RGPLPPLPNDS (SEQ ID NO:74) , YRFRALPSPPSAS , LAQRQLPPTPGRD 
ALQRRLPRTPPPA (SEQ ID NOS: 78-80), YSTRPLPSRPSRT, and 

10 XP6RILLLPSEPR (SEQ ID NOS:82-83). 

41. A construct comprising a nucleic acid encoding a 
peptide of claim 1 or its complement. 

42. The construct of claim 41 which is a DNA 
polynucleotide . 

15 43. The construct of claim 41 which is a RNA 

polynucleotide • 

44. A construct comprising a nucleic acid encoding a 
peptide of claim 18 or its complement. 

45. The construct of claim 44 which is a DNA 
20 polynucleotide. 

46. The construct of claim 44 which is a RNA 
polynucleotide. 

47. The construct of claim 41 which is a transforming 
vector. 

2S 48. The construct of claim 44 which is a transforming 

vector. 

49. A host cell transformed with the vector of claim 

47. 

50. A host cell transformed with the vector of claim 

30 48. 

51. A conjugate comprising a peptide of claim 1 and a 
second molecule. 

52. The conjugate of claim 51 in which said second 
molecule is selected from the group consisting of an amino 

35 acid, a peptide, a protein, a nucl ic acid, a nucl oside, a 
glyc sidic r sidu , a label, a drug r a small molecule. 
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53. A dlagn stic kit for the detection of SH3 d mains 
conprising an SH3 d main-binding peptid and a det ctabl 
label conjugated to said p ptid directly, indirectly or by 
complexation, said peptide comprising: (i) a core sequence 

5 motif of the formula RXLP00P (SEQ ID NO: 71), wherein X 

represents any amino acid except cysteine and 0 represents a 
hydrophobic amino acid residue, each letter representing the 
standard one-letter designation for the corresponding amino 
acid residue; and (ii) two or more additional amino acid 
10 residues flanking said core sequence at its C-terminal end, 
N-terminal end or both. 

54. A drug delivery system comprising an SH3 domain- 
binding peptide and a drug conjugated to said peptide 
directly, indirectly or by complexation, said peptide 

15 comprising: (i) a core sequence motif of the formula RXLP00P 
(SEQ ID NO: 71), wherein X represents any amino acid except 
cysteine and <p represents a hydrophobic amino acid residue, 
each letter representing the standard one-letter designation 
for the corresponding amino acid residue; and (ii) two or 

20 more additional amino acid residues flanking said core 
sequence at its C-terminal end, N-terminal end or both. 

55. The drug delivery system of claim 54 which may be 
administered parenterally , orally, enterally, topically or by 
inhalation. 

25 56. The drug delivery system of claim 54 which may be 

administered intranasal ly, opthalmically or intravaginally • 

57. The drug delivery system of claim 54 which is in 
the form of a solid, gel, liquid or aerosol. 

58. A method of modulating the activity of Src or Src- 
30 related proteins comprising administering a composition 

comprising an effective amount of a peptide of claim 1 and a 
carrier . 

59. The method of claim 58 which inhibits the activity 
of Src or Src-related proteins. 

35 60. The method of claim 58 which activates Src or Src- 

relat d pr teins. 
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61. A method of identifying a peptide having a region 
that binds t an SH3 domain c mprising: 

(a) providing an imm bilized target pr tein 
comprising an SH3 domain; 
S (b) incubating said immobilized target protein 

with an aliquot taken from a random peptide library; 

(c) washing unbound peptide from said immobilized 
target protein; 

(d) recovering the peptide bound to said 
10 immobilized target protein; and 

(e) determining the primary sequence of the SH3 
domain-binding peptide « 

62 « The method of claim 61 in which said library is a 
displayed random peptide library. 
15 63. The method of claim 62 in which said library is a 

phage-displayed random peptide library. 

64. The method of claim 62 in which said library is a 
phagemid -displayed random peptide library. 

65. The method of claim 61 in which step (c) includes 
20 washing unbound phage from said immobilized target protein; 

step (d) includes recovering the phage bound to said 
immobilized target protein; and step (e) includes determining 
the relevant nucleotide sequence of said binding phage 
nucleic acid, from which the primary sequence corresponding 
2S to the SH3 domain-binding peptide is deduced. 

66. A method of identifying a peptide having a region 
that binds to an SH3 domain comprising: 

(a) providing an immobilized target protein 
comprising an SH3 domain; 
30 (b) incubating said immobilized target protein 

with an aliquot taken from a phage-di splayed random peptide 
library, which library includes peptides having a random 
sequence of ^8 amino acid residues; 

(c) washing unbound phage from said immobilized 
35 target pr tein; 

(d) recovering the phage bound to said immobillz d 
targ t protein; and 
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(e) determining the r levant nucl otide secru nee 
t said binding phage nucleic acid and d ducing the prixoazy 
8 quenc c rr spending to the SH3 d main-binding peptide. 
67. The method of claim 66 which further comprises 
5 amplifying the titer of the recovered phage. 

6S* The method ©f claim 66 which further comprises 
repeating the steps of inciibation, washing and recovery to 
provide SH3 domain-binding peptide-enriched phage. 

69. A pharmaceutical composition comprising an SH3 
10 domain-binding peptide and a pharmaceutically acceptable 

carrier, said peptide comprising: (i) a 9-mer sequence motif 
of the formula RXLP^PX^ (SEQ ID NO: 10), wherein X represents 
any amino acid except cysteine, 0 represents a hydrophobic 
amino acid residue, and wherein is a hydrophilic amino acid 

15 residue except cysteine, each letter representing the 

standard one-letter designation for the corresponding amino 
acid residue; and, optionally, (ii) additional amino acid 
residues flanking said 9-mer seqn^ence at its C-terminal end, 
N- terminal end or both, up to a total of 45 amino acid 

20 residues, including said 9-mer seqpience. 

70. The composition of claim 69 in which at least one 
additional amino acid flanks said 9-mer sequence. 

71. The composition of claim 69 in which at least two 
additional amino acids flank said 9-mer sequence. 

25 72. The composition of claim 69 in which at least three 

additional amino acids flank said 9-mer sequence. 

73. A method of disrupting protein tyrosine kinase- 
mediated signal transduction pathways comprising 
administering an effective amount of a peptide of claim 1. 

30 74. A method of regulating the processing, trafficking 

or translation of RNA by administering an effective amount of 
a peptide of claim 1. 



35 
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1/14 



Pro.Gly 

GGC . TCG . AGN . (NNB)jg . CCA . GGT 



N=A,C,G,T 

B=C,G,T 

V=A,C,G 

Xho I 



GGC . TCG . AGN . <NNB)jq . CCA . GGT 



GGT.CCA.<NNV>jg .AGA.TCT.GG 



FILL IN WITH Toq DNA POLYMERASE 



GGT . CCA . (NNV)jQ . AGA . TCT . G6 



XbQ I 

CLEAVE WITH Xho I AND Xba I 



t 



TCG.AGN.<NNB)jg.CCA.GGT 



GGT.CCA.<NNV)jg.AGA.TC 



LIGATE WITH Xho I + Xba I- 
CLEAVED Ml 3 n663 VECTOR 

ELECTROPORATE INTO E. coli DH5a F 



TSAR~-9 LIBRARY 
OF pIII-RANDDM SEQUENCE FUSION PROTEINS 



. . . S H S|S <R/S) X.«P G Xjg S R 



18 



P S R T . . . 



SIGNAL PEPTIDASE CLEAVAGE SITE 



FIG.1 
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